VOLUME 3,NUMBER 1 JANUARY 2005 

ISSN:1548-5390 PRINT,1559-176X ONLINE 




JOURNAL 
OF CONCRETE 



AND APPLICABLE 



MATHEMATICS 



EUDOXUS PRESS,LLC 



SCOPE AND PRICES OF THE JOURNAL 
Journal of Concrete and Applicable Mathematics 

A quartely international publication of Eudoxus Press, LLC 

Editor in Chief: George Anastassiou 

Department of Mathematical Sciences, 
University of Memphis 
Memphis, TN 38152, U.S.A. 
ganastss@memphis . edu 

The main purpose of the "Journal of Concrete and Applicable 
Mathematics" is to publish high quality original research articles from 
all subareas of Non-Pure and/or Applicable Mathematics and its many 
real life applications, as well connections to other areas of 
Mathematical Sciences, as long as they are presented in a Concrete way. 
It welcomes also related research survey articles and book reviews. A 
sample list of connected mathematical areas with this publication 
includes and is not restricted to: Applied Analysis, Applied Functional 
Analysis, Probability theory, Stochastic Processes, Approximation 
Theory, O.D.E, P.D.E, Wavelet, Neural Networks, Difference Equations, 
Summability, Fractals, Special Functions, Splines, Asymptotic Analysis, 
Fractional Analysis, Inequalities, Moment Theory, Numerical Functional 
Analysis, Tomography, Asymptotic Expansions, Fourier Analysis, Applied 
Harmonic Analysis, Integral Equations, Signal Analysis, Numerical 
Analysis, Optimization, Operations Research, Linear Programming, 
Fuzzyness, Mathematical Finance, Stochastic Analysis, Game Theory, 
Math. Physics aspects, Applied Real and Complex Analysis, Computational 
Number Theory, Graph Theory, Combinatorics, 

Computer Science Math. related topics, combinations of the above, etc. In 
general any kind of Concretely presented Mathematics which is 
Applicable fits to the scope of this journal. 

Working Concretely and in Applicable Mathematics has become a main 
trend in many recent years, so we can understand better and deeper and 
solve the important problems of our real and scientific world. 
"Journal of Concrete and Applicable Mathematics" is a peer- reviewed 
International Quarterly Journal. 

We are calling for papers for possible publication. The contributor 
should send three copies of the contribution to the editor in-Chief 
typed in TEX, LATEX double spaced. [ See: Instructions to Contributors] 

Journal of Concrete and Applicable Mathematics(JCAAM) 
ISSN:1548-5390 PRINT, 1559-176X ONLINE. 

is published in January ,April,July and October of each year by 
EUDOXUS PRESS,LLC, 

1424 Beaver Trail Drive,Cordova,TN38016,USA, 
Tel.001-901-75 1-3553 

anastas sioug @ yahoo .com 
http://www.EudoxusPress.com. 

Annual Subscription Current Prices:For USA and Canada,Institutional:Print 
$250,Electronic $220,Print and Electronic $310.Individual:Print $77, Electronic $60,Print 
&Electronic $1 10. For any other part of the world add $25 more to the above prices for 
Print. 



Single article PDF file for individual $8. Single issue in PDF form for individual $25. 

The journal carries page charges $8 per page of the pdf file of an article,payable upon 

acceptance of the article within one month and before publication. 

No credit card payments. Only certified check,money order or international check in US 

dollars are acceptable. 

Combination orders of any two from JoCAAAJCAAMJAFA receive 25% discount,all 

three receive 30% discount. 

Copyright©2004 by Eudoxus Press,LLC all rights reserved.JCAAM is printed in USA. 

JCAAM is reviewed and abstracted by AMS Mathematical Reviews,MATHSCI,and 
Zentralblaat MATH. 

It is strictly prohibited the reproduction and transmission of any part of JCAAM and in 
any form and by any means without the written permission of the publisher.lt is only 
allowed to educators to Xerox articles for educational purposes. The publisher assumes no 
responsibility for the content of published papers. 
JCAAM IS A JOURNAL OF RAPID PUBLICATION 



Editorial Board 
Associate Editors 



Editor in -Chief: 

George Anastassiou 

Department, of Mathematical Sciences 

The University Of Memphis 

Memphis, TN 38152, USA 

tel. 901-678-3144, fax 901-678-2480 

e-mail ganastss@memphis.edu 

www.msci .memphis . edu/~ganastss/ jcaam 

Areas : Approximation Theory, 

Probability , Moments , Wavelet , 

Neural Networks, Inequalities, Fuzzyness . 

Associate Editors : 

1) Ravi Agarwal 

Florida Institute of Technology 

Applied Mathematics Program 

150 W. University Blvd. 

Melbourne, FL 32901, USA 

agarwal @ fit . edu 

Differential Equations, Difference 

Equations, inequalities 



2) Shair Ahmad 

University of Texas at San Antonio 

Division of Math.S Stat. 

San Antonio, TX 78249-0664, USA 

SAHMAD@utsa . edu 

Differential Equations , Mathematical 

Biology 

3) Drumi D.Bainov 
Medical University of Sofia 
P.O.Box 45,1504 Sofia, Bulgaria 
drumibainov@yahoo . com 

Differential Equations, Optimal Control, 
Numerical Analysis, Approximation Theory 

4) Carlo Bardaro 

Dipartimento di Matematica & Informatica 

Universita ' di Perugia 

Via Vanvitelli 1 

06123 Perugia, ITALY 

tel. +390755855034, +390755853822, 

fax +390755855024 

bardaro@unipg . it , 

bardaro@dipmat . unipg . it 

Functional Analysis and Approximation Th., 



19) Rupert Lasser 

Institut fur Biomathematik & Biomertie, GSF 

-National Research Center for environment and 

health 

Ingolstaedter landstr . 1 

D-85764 Neuherberg, Germany 

lasser@gsf . de 

Orthogonal Polynomials, Fourier Analysis, 

Mathematical Biology 

20) Alexandru Lupas 
University of Sibiu 
Faculty of Sciences 
Department of Mathematics 
Str.I.Ratiu nr.7 
2400-Sibiu, Romania 
lupas@ulbsibiu . ro 

Classical Analysis, Inequalities, 
Special Functions, Umbral Calculus, 
Approximation Th ., Numerical Analysis 
and Methods 

21) Ram N.Mohapatra 
Department of Mathematics 
University of Central Florida 
Orlando, FL 32816-1364 

tel. 407-823-5080 

ramm@mail . ucf . edu 

Real and Complex analysis, Approximation Th., 

Fourier Analysis, Fuzzy Sets and Systems 

22) Rainer Nagel 

Arbeit sbereich Funktionalanalysis 

Mathematisches Institut 

Auf der Morgenstelle 10 

D-72076 Tuebingen 

Germany 

tel. 49-7071-2973242 

fax 49-7071-294322 

rana@f a . uni-tuebingen . de 

evolution equations, semigroups, spectral th., 

positivity 

23) Panos M.Pardalos 

Center for Appl . Optimization 
University of Florida 
303 Weil Hall 
P.O.Box 116595 
Gainesville, FL 32611-6595 



Summability, Signal Analysis, Integral 

Equations, 

Measure Th.,Real Analysis 

5) Francoise Bastin 
Institute of Mathematics 
University of Liege 
4000 Liege 

BELGIUM 

f . bastin@ulg . ac . be 

Functional Analysis, Wavelets 

6) Paul L.Butzer 
RWTH Aachen 

Lehrstuhl A fur Mathematik 

D-52056 Aachen 

Germany 

tel. 0049/241/80-94627 office, 

0049/241/72833 home, 

fax 0049/241/80-92212 

Butzer@rwth-aachen . de 

Approximation Th., Sampling Th., Signals, 

Semigroups of Operators, Fourier Analysis 

7) Yeol Je Cho 

Department of Mathematics Education 

College of Education 

Gyeongsang National University 

Chinju 660-701 

KOREA 

tel. 055-751-5673 Office, 

055-755-3644 home, 

fax 055-751-6117 

y jcho@nongae . gsnu . ac . kr 

Nonlinear operator Th. , Inequalities, 

Geometry of Banach Spaces 

8) Sever S.Dragomir 

School of Communications and Informatics 

Victoria University of Technology 

PO Box 14428 

Melbourne City M.C 

Victoria 8001, Australia 

tel 61 3 9688 4437, fax 61 3 9688 4050 

sever . dragomir@ vu . edu . au , 

sever@sci . vu . edu . au 

Math . Analysis , Inequalities , Approximation 

Th. , 

Numerical Analysis, Geometry of Banach 

Spaces, 

Information Th. and Coding 

9) A.M. Fink 

Department of Mathematics 
Iowa State University 
Ames,IA 50011-0001, USA 



tel. 352-392-9011 
pardalos@uf 1 . edu 
Optimization, Operations Research 

24) Svetlozar T.Rachev 

Dept.of Statistics and Applied Probability 

Program 

University of California, Santa Barbara 

CA 93106-3110, USA 

tel. 805-893-4869 

rachev@pstat . ucsb . edu 

AND 
Chair of Econometrics and Statistics 
School of Economics and Business Engineering 
University of Karlsruhe 
Kollegium am Schloss,Bau II, 20 . 12, R210 
Postfach 6980, D-76128, Karlsruhe, Germany 
tel . 011-49-721-608-7535 
rachevglsoe . uni-karlsruhe . de 
Mathematical and Empirical Finance, 
Applied Probability, Statistics and Econometrics 

25) Paolo Emilio Ricci 

Universita' degli Studi di Roma "La Sapienza" 

Dipartimento di Matematica-Istituto 

"G.Castelnuovo" 

P.le A. Moro, 2-00185 Roma, ITALY 

tel. ++39 0649913201, fax ++39 0644701007 

riccip@uniromal . it , Paoloemilio . Ricci@uniromal . it 

Orthogonal Polynomials and Special functions, 

Numerical Analysis, Transforms, Operational 

Calculus, 

Differential and Difference equations 

26) Cecil C.Rousseau 

Department of Mathematical Sciences 
The University of Memphis 
Memphis, TN 38152, USA 
tel . 901-678-2490, fax 901-678-2480 
ccrousse@memphis . edu 
Combinatorics , Graph Th . , 
Asymptotic Approximations, 
Applications to Physics 

27) Tomasz Rychlik 
Institute of Mathematics 
Polish Academy of Sciences 
Chopina 12,87100 Torun, Poland 
T . Rychlik@impan . gov . pi 
Mathematical Statistics, Probabilistic 
Inequalities 

28) Bl. Sendov 

Institute of Mathematics and Informatics 
Bulgarian Academy of Sciences 
Sofia 1090, Bulgaria 



tel. 515-294-8150 

f inkgmath . iastate . edu 

Inequalities, Ordinary Differential 

Equations 

10) Sorin Gal 
Department of Mathematics 
University of Oradea 
Str.Armatei Romane 5 
3700 Oradea, Romania 
galsoguoradea . ro 

Approximation Th. , Fuzzyness, Complex 
Analysis 

11) Jerome A.Goldstein 
Department of Mathematical Sciences 
The University of Memphis, 
Memphis, TN 38152, USA 

tel. 901-678-2484 
jgoldste@memphis . edu 
Partial Differential Equations, 
Semigroups of Operators 

12) Heiner H.Gonska 
Department of Mathematics 
University of Duisburg 
Duisburg, D-47048 
Germany 

tel. 0049-203-379-3542 office 
gonska@inf ormatik . uni-duisburg . de 
Approximation Th . , Computer Aided 
Geometric Design 

13) Dmitry Khavinson 

Department of Mathematical Sciences 

University of Arkansas 

Fayetteville,AR 72701, USA 

tel. (479) 575-6331, fax(479) 575-8630 

dmitry@uark . edu 

Potential Th., Complex Analysis, Holomorphic 

PDE , Approximation Th . , Function Th . 

14) Virginia S.Kiryakova 

Institute of Mathematics and Informatics 

Bulgarian Academy of Sciences 

Sofia 1090, Bulgaria 

virginia@diogenes.bg 

Special Functions, Integral Transforms, 

Fractional Calculus 

15) Hans-Bernd Knoop 
Institute of Mathematics 
Gerhard Mercator University 
D-47048 Duisburg 

Germany 

tel . 0049-203-379-2676 



bsendov@bas . bg 

Approximation Th., Geometry of Polynomials, 

Image Compression 

29) Igor Shevchuk 

Faculty of Mathematics and Mechanics 

National Taras Shevchenko 

University of Kyiv 

252017 Kyiv 

UKRAINE 

shevchuk@univ . kiev . ua 

Approximation Theory 

30) H.M.Srivastava 

Department of Mathematics and Statistics 

University of Victoria 

Victoria, British Columbia V8W 3P4 

Canada 

tel. 250-721-7455 office, 250-477-6960 home, 

fax 250-721-8962 

harimsri@math . uvic . ca 

Real and Complex Analysis, Fractional Calculus 

and Appl . , 

Integral Equations and Transforms, Higher 

Transcendental 

Functions and Appl . , q-Series and q-Polynomials, 

Analytic Number Th. 

31) Ferenc Szidarovszky 

Dept . Systems and Industrial Engineering 

The University of Arizona 

Engineering Building, 111 

PO.Box 210020 

Tucson, AZ 85721-0020, USA 

szidar@sie . arizona . edu 

Numerical Methods, Game Th., Dynamic Systems, 

Multicriteria Decision making, 

Conflict Resolution, Applications 

in Economics and Natural Resources 

Management 



and Geodesy 



32) Gancho Tachev 

Dept. of Mathematics 

Univ. of Architecture, Civil Eng. 

1 Hr . Smirnenski blvd 

BG-1421 Sofia, Bulgaria 

Approximation Theory 



33) Manfred Tasche 

Department of Mathematics 

University of Rostock 

D-18051 Rostock 

Germany 

manf red. tasche@mathematik . uni-rostock . de 

Approximation Th. , Wavelet, Fourier Analysis, 

Numerical Methods, Signal Processing, 



knoop@math . uni-duisburg . de 
Approximation Theory, Interpolation 

16) Jerry Koliha 

Dept . of Mathematics & Statistics 

University of Melbourne 

VIC 30 10, Melbourne 

Australia 

koliha@unimelb . edu . au 

Inequalities, Operator Theory, 

Matrix Analysis, Generalized Inverses 

17) Mustafa Kulenovic 
Department of Mathematics 
University of Rhode Island 
Kingston, RI 02881, USA 
kulenm@math . uri . edu 

Differential and Difference Equations 

18) Gerassimos Ladas 
Department of Mathematics 
University of Rhode Island 
Kingston, RI 02881, USA 
gladas@math . uri . edu 

Differential and Difference Equations 



Image Processing, Harmonic Analysis 

34) Chris P.Tsokos 
Department of Mathematics 
University of South Florida 
4202 E. Fowler Ave., PHY 114 
Tampa, FL 33620-5700, USA 

prof cpt@math . usf . edu, prof cpt@chumal . cas . usf . edu 
Stochastic Systems , Biomat hematics , 
Environmental Systems, Reliability Th. 

35) Lutz Volkmann 
Lehrstuhl II fuer Mathematik 
RWTH-Aachen 
Templergraben 55 

D-52062 Aachen 

Germany 

volkm@math2 . rwth-aachen . de 

Complex Analysis, Combinatorics, Graph Theory 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS,VOL.3,N0.1 ,9-29,2005,COPYRIGHT 2005 EUDOXUS PRESS.LLC 

Hermite-Kampe de Feriet polynomials 
and solutions of Boundary Value Problems 

in the half-space 

Giulia Maroscia 

Universita di Roma "La Sapienza", Dipartimento di Metodi e Modelli Matematici 

per le Scienze Applicate, Via A.Scarpa, 16, 00161 Roma, Italia 

e-mail: gmaroscia@dmmm.uniromal.it 

Paolo E. Ricci 

Universita di Roma "La Sapienza", Dipartimento di Matematica, P.le A. Moro, 2 00185 

Roma, Italia - e-mail: riccip@uniromal.it 

Abstract 

Explicit solutions of multidimensional canonical BVP of parabolic, hyperbolic 
and elliptic type in the half-space are derived by using operational methods and 
series expansions in terms of the Hermite-Kampe de Feriet polynomials. 
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1 Introduction 

In a preceding article [5] , the two-dimensional polynomials considered by Hermite, and 
subsequently studied by P. Appell and J. Kampe de Feriet [1], H.W. Gould and A.T. 
Hopper [12], (see also: [16], p. 76, Eq. 1.9 (6)), G. Dattoli et al. [7], were stressed in 
order to obtain explicit solutions of all the canonical (parabolic, hyperbolic or elliptic) 
BVP in the half-plane. 

For shortness, in the following, we will use the abbreviation H-KdF to denote the 
Hermite-Kampe de Feriet polynomials. 

In the present article, by using the same operational approach recalled in [6], we 
present here some generalizations of the results obtained in [5] to the case of the solution 
of all the canonical BVP in the half-space. 

According to our results, the two-dimensional H-KdF polynomials appear as the 
natural tool for representing the solutions of all these problems. 

It is worth noting that the use of operational methods in this framework naturally 
leads to the consideration of fractional operators. However in all the considered cases, it 



10 G.MAROSCIA.P.RICCI 



is possible to exploit the peculiarity of problems in order to avoid fractional derivatives, 
so that the relevant solutions are always expressed in terms of suitable series expansions 
for which a general convergence criterion is proved. 

The convergence conditions on the data expressed by this criterion are not necessary. 
It should be interesting to find more general conditions to be satisfied by the data in 
order to ensure convergence for the formal expansions, but the main motivation of this 
article is to show the relevance of the H-KdF polynomials in the explicit solutions of 
the considered problems. 

We recall that, in the two-dimensional case, the H-KdF polynomials are closely re- 
lated to the classical Hermite ones, and consequently to the Gauss' normal distribution 
which is the weight of the Hermite polynomials. 

We conjectured in [5] that this was the hidden motivation of the possibility to 
express the solutions of all the canonical BVP. 

In any case, the importance of the H-KdF polynomials in the framework of the 
multidimensional or multi-index special functions was recently recognized (see e.g. [7], 
[4], [10]), and their importance even in the solution of BVP seems to be another 
property which is worthy to be noted (see e.g. [5], [2], [15]). 

2 Hermite-Kampe de Feriet polynomials 

We recall the definitions of the H-KdF polynomials, in the two-dimensional case. 
Definition 2.1 The H-KdF polynomials in two variables H$(x,y) are defined by 

[ml 

[ 2 J n m— 2n 

H£\x,y):=m\Y, -J -— (2.1) 

£t nl{m — 2n)\ 

Definition 2.2 The H-KdF polynomials in two variables H$(x,y) are defined by 

[™1 

H^(x,y):=m\Y: ~l "tt (2.2) 

^ n\[m-jn)\ 

In a number of articles by G. Dattoli et al., (see e.g. [7], [8], [9]), by using the 
so called monomiality principle, the following properties for the two-variable H-KdF 
polynomials H^(x,y), j>2 have been recovered. 



Operational definition 

di ( 

dxi 



H^(x,y) = e^x m =(x + jy— z - 1 ) (1). (2.3) 
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Generating function 

oo ±n 

Y,H«\x,y)- = e* t+ y tJ . (2.4) 



„=o n - 



In the case when j = 2, (see [19]), the H-KdF polynomials Hffl(x,y) admit the 
following 



Integral representation 



1 Z'+OO ( x _£) 2 

H%>(x,y) = —— / re'^dt (2.5) 

2Jwy. 



oc 



which is a particular case of the so called Gauss- Weierstrass (or Poisson) transform 
[18]. 

3 Hyperbolic and circular functions of the derivative opera- 
tor 

The two-variable H-KdF polynomials allow us to define, in a constructive way, the 
hyperbolic and circular functions of the derivative operator. 

Definition 3.1 For any j > 1 

oo 

cosh(yDi)f(x) := £ a m K%(x,y), (3.1) 

m=0 

where 

JC$(x,y) = Ey(HV>(x,v)) (3.2) 

and Sy(-) denotes the even part, with respect to the y variable, of the considered 
H-KdF polynomial. 

Definition 3.2 For any j > 1 

oo 

smh(yDi)f(x) := £ a m S$(x,y) (3.3) 

where 

SV>(x,y) = O v (H<>>(x,yj) (3.4) 

and Oy(-) denotes the odd part, with respect to y, of the considered H-KdF polynomial. 

The properties of the hyperbolic and circular functions of the derivative operator, 
and the relevant generalization to the pseudo-hyperbolic and pseudo-circular functions 
(see [11], [17], [3]), were considered in [6]. 
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4 Convergence results 

In this section we will recall an uniform estimate, with respect to j, for the convergence 
of series involving the H-KdF polynomials H^\x, y): 

oo 

£ a n H^(x,y). (4.1) 

n=0 

Theorem 4.1 For every j > 2, — oo < x < +oo ; — oo < y < +oo ; n = 0, 1, 2, . . ., 
£/ze following estimate holds true: 

\H { J\x,y)\ <n\ exp {\x\ + \y\} . (4.2) 

The proof is derived by the same method used in the book of Widder [19], p. 166, 
for the case j '• = 2. A deep analysis of the convergence condition for the series (4.1), in 
the case j = 2, is performed in this book, however the relevant estimates can be only 
partially extended to the general case, since many of them are based on the integral 
representation (2.5). Unfortunately, if j > 2, an integral representation generalizing 
the Gauss- Weierstrass transform is known only when j = 2q, q being an odd number 
(see [13], [14]). 

oo 

Considering, as boundary (or initial) data, analytic functions f(x) = Y^ a n x n 

71=0 

whose coefficients a n tend to zero sufficiently fast, we can prove the following theorem: 

Theorem 4.2 Suppose there exists a number a > 1, such that the coefficients a n 
satisfy the following estimate: 

M = O (-^) , (4.3) 

\n a n\/ 

then, for every j , the series expansion (4.1) is absolutely and uniformly convergent in 
every bounded region of the (x,y) -plane. 

Proof. The result immediately follows from the estimate 

oo 

i#'W)|<e |x|+ll ' l E l a > ! > ( 44 ) 

n=0 

considering that the last series is convergent by condition (4.3). 

Remark 4.1 The condition (4.3) include analytic functions with polynomial growth 
at infinity, but not the exponential function e x , whereas, in the case j '• = 2, the book 
of Widder [19] includes all functions belonging to the so called Huygens class H°. 



oo 


oo 


E a n H^(x,y) 


< Yl Kl 


n=0 


n=0 
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Remark 4.2 Obviously the above convergence conditions are sufficient, but not nec- 
essary, and it should be interesting to extend it in order to include wider classes of 
boundary (or initial) data. However, by the point of view of the Applied Analysis, the 
considered conditions cover many realistic situations, since data are usually expressed 
by bounded functions, vanishing at infinity, and negligible outside a suitable bounded 
interval. 

In the following we show explicit solutions of the canonical BVP of parabolic, hy- 
perbolic or elliptic type problems, starting from the three-dimensional case, in or- 
der to introduce our results in a more friendly way. The relevant extensions to the 
n-dimensional case are almost straightforward, since the demonstration methods are 
exactly the same. Consequently, the general results are given without proofs. 

5 The two-dimensional heat equation 

We start considering the two-dimensional heat equation. Assuming that the initial 
condition q(x, y) is expressed by an analytic function, and using an operational ap- 
proach, we find the explicit solution in terms of the H-KdF polynomials. A convergence 
result is given generalizing the Theorem 4.2. 

Theorem 5.1 Consider the two-dimensional heat equation 

dS d 2 S d 2 S . , , ir 

— — = — — + -7— - in the halt — space t > 0, 

ot ox 2 ay 2 ' {h\\ 

S(x,y,0) = q(x,y) 

with analytic initial condition. Then, the operational solution is given by: 

S(x,y,t) = e t ^ +D yk(x,y), (5.2) 

d d 

where, by definition: D x = — — and D v = — — . 

ox ay 

Proof. We have, indeed: 

dt \dx 2 dy 2 J ' dx 2 dy 2 

and the initial condition, when t — 0, is trivially satisfied. 
Considering the Taylor expansion of q(x,y): 

00 
q(x,y)= J2 a a x ai y a * (5.3) 

n=0 

|a|<n 
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(where we used the multi-index notation: a = (cti, 0:2), \ct\ '■— ol\ + ol-i), it is possible 
to write down explicitly the solution (5.2). 
Consider first the case when q(x,y) = x p y q . Then: 

e t(Dl+Dl) xPyq = ( x + 2tD x )P(y + 2tD y ) q (l) = {x + 2tD x ) p H^{y, t) = 



£ [, )Hf] k {x,t){2tD x ) k Hf\y,t) = H*\x,t)H?\y,t) 



fc=0 



.A' 



where we used the Hausdorff and the Burchnall identity (see [7]). 

Consequently, for the analytic initial condition (5.3), the solution will be expressed by: 



S(x, y, t) = e^+^S) £ a a x" 1 y 



«1 „,«2 



(5.4) 



n=0 

|a|<n 



]T a a (x + 2tD x r(y + 2W y r*(l) = ]T a a H™(x,t) H%(y,t). 



n=0 

\ct\<n 



n=0 

\a\<n 



A sufficient condition for the convergence of the expansion (5.4) is expressed by the 
following result, generalizing Theorem 4.2: 

Theorem 5.2 Suppose that there exist numbers £ x > 1, £ 2 > 1 such that the coeffi- 
cients c 7 of the series expansion 



£ *,!£%> (x,t)H$(y,t) 



satisfy 



n=0 

\j\<n 








C == 





V7i 6 7i! 


1 


72S2! 



(5.5) 



(5.6) 



i/ien £/ie series (5.5) zs absolutely and uniformly convergent in every bounded region 
of the (x, y, t) -space. 

Proof. Using the estimate (4.2) of Theorem 4.1: 

H ( j\x,t) <n!exp{|z| + \t\} , 

we deduce: 



£ OrH^x^H^t) 



?1=0 

|7|<n 



< £ |c 7 | |# 7 ?0M)# 7 ?(^) < 2 |c 7 | 7l ! 72 !e 



,|x|+|»|+2|t| 



ra=0 
|7|<n 



n=0 
|7|<n 



so that, since by hypothesis the last series is convergent, the series expansion (5.5) 
satisfies the Weierstrass test in every bounded region of the (x, y, t)-space. 
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6 The general heat equation 

Denote by x the (n — l)-dimensional vector x = (x±, x 2 , ■ ■ ■ , i n _i), and consider the 
corresponding (n — l)-dimensional heat equation, with analytic initial condition q(x). 
Putting A := d 2 jdx\ + d 2 jdx\ + . . . + d 2 /dx 2 l _ 1 , the result of the preceding section 
can be easily generalized as follows. 

Theorem 6.1 The operational solution of the problem 

OS 

—- = AS, in the half — space t > 0, 

dt (6.1) 

. S(x,0) =q(x), 

is given by 

S(x,t) = e tA q(x). (6.2) 

If q(x) = Xi 1 ■ ■ ■ x^ 1 , then the explicit solution of the problem (6.2) is given by 

^(T t\ - P fA T Ql . . . T 1 *"- 1 — 

= (xi + 2tD xl ) ai ■ ■ • (x n _i + 2tD Xn _ 1 ) an ~ 1 (1) = 
= H^(x 1 ,t).-.H^_ l (x n . 1 ,t). 

For a general analytic initial condition 



<?(z) = J2 a aX a , 



n=0 
\a\<n 



the explicit solution is expressed by the series expansion (6.1): 



S(x,t) = e tA Y, a a x a = 

n=0 

\a\<n 

oo 

= E«« (*i + 2tD Xl ) ai ■ ■ ■ (xn-i + 2tD Xn _ 1 ) an " (1) = 

n=0 
\a\<n 

oo 

= J2 a c t H^(x 1 ,t)---H^_ l (x n . 1 ,t). (6.3) 

ra=0 
\a\<n 

A convergence condition is given by the theorem 
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Theorem 6.2 // there exist n — 1 numbers £, > 1, (i = 1, 2, . . . , n — 1), such that 
the coefficients of the series (6.3) satisfy the condition: 

\a a \=0[-^ — ^ J (6.4) 

i/ien £/ie series is absolutely and uniformly convergent in every bounded region of the 
(x, t) space. 

7 The canonical hyperbolic problem in R 3 

In this section we find explicit solutions for the canonical hyperbolic problem in R 3 , 
with analytic initial conditions. Convergence results are also derived. 

Theorem 7.1 Consider the hyperbolic problem in R 3 ; with analytic initial data q(x, y) 
and v(x,y): 

d 2 S d 2 S d 2 S , , 1P 

— — = t-it + -^—r in the hall - space t > 0, 

ot z ox 2 ay 2 (7 1) 

d ' 

S(x, y, 0) = q(x, y), fo S ( x ' V> °) = w ( x ' ^)- 

T/ie operational solution of the problem (7.1) zs <jwen 6y: 

/ d 2 d 2 \^\ ( ( d 2 d 2 Y 2 

S{,V ■ !J - f)=c "* h \ t [~fa? + dy 2 ) ) q ( X,y ^ + Sinh [ t [dx^ + dy^l I "'(•'■■ -V) (72) 

/ d 2 d 2 \~^ 
where we put, by definition: w(x,y) := -7—7 + 7-— v(x,y). 

\ox 2 dy 2 J 

Proof. The initial conditions are trivially satisfied, furthermore, by differentiating two 
times with respect to t equation (7.2), we find: 

ds ( d 2 d 2 \ l , / ( d 2 d 2 VA f , 
m = [dx- 2 + df) smh [ t W 2 + d?) ) q{x,y) + 

( d 2 d 2 \^ '[ ( d 2 d 2 \ r . 

+ [d^ 2 + df cosh VW + w ]w{x ' y) 



d 2 S / d 2 d 
+ 



dt 2 \ dx 2 dy 2 j 

d 2 S d 2 S 
dx 2 dy 2 



cosh 1 * (L 2 + h ) J g( "' y) + sinh v (h + h ) ' w(x ' y) 



HERMITE-KAMPE DE FERIET POLYNOMIALS... 



17 



By using the Taylor expansion of the initial data: 



Q(x,y)= J2 a aX ai y a \ v(x,y) = J2 bpx^y 132 



(7.3) 



n=0 

|a|<n 



n=0 

|/9|<n 



where a :— («i, a 2 ), P — (A.,/^), and |a| := a^ + a 2 , \P\ '■= Pi + P2, we try to write 
down explicitly the solution of problem (7.1). 

To this aim, we consider first the operational identity: 



— —Yx' 

dx 2 dy 2 I 



V = E 



m ^ m\ ( r) \ 2fc / r) \ 2 ( m ~ fc ) 



fc=o 



k I \dx I \dy 



x p y q 



d 2 d 2 

where we used the commuting property of the operators — ^- and — — 

ox 2 ay 1 

By using the identities: 



(7.4) 



dx 



2 A: 



X/ 



p{j> — 1) • • • ip — 2k + l)x 



p-2k 



J_! x P~-2k 



ip-2k)V 



for p — 2k > 0, or k < 






2(m-k) 



q(q — 1) • • • (q — 2{m — k) + l)y' 



q—2(m—k) 



(q-2(m-k))\ 



y 



q—2(m—k) 



for q — 2(m — k) > 0, or m < | + k < 



+ 



, we find the following equation: 



d 2 d 2 \ m 



.?" 



V = E 



m\ pi 



q\ 



k=0 



kj (p-2k)\ (q-2(m-k))\ 



xP -2k y q-2(m-k) ^ 



\ dx 2 dy 2 

where m < | + | (and, by definition, the terms for which m — k < or m — k> 
must be replaced by zero). The above equation (7.5) can be written in the equivalent 
form: 



dx 2 dy 2 J 



x p y q 



E 



ml pi 



, i+ ,, in k 1 \k 2 \{p-2k 1 )\{q-2k 2 )\ 

0<fci<[f] 

0<fc 2 <[§] 



t x p-2ki q-2k 2 



(7.6) 
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We use now the identity (7.6) in order to represent the solution of the problem (7.1) 
when the initial data are expressed by the monomials: 

q(x,y) = x p y q , v(x,y) = x h y r . 

In this case, we find: 



cosh t 



d 2 



d 



2 \ 2 



oo i2m 



[■2\ H £2m 



dx 2 dy 2 J 

E 



x p y q 



m! 



E 



t i 



o 2 



o 2 



= ' (2m)! \dx 2 dy 2 
p\ q\ 



x p y q 



£ ( 2 ™) ! fel+ f =™ hW (P ~ 2h)\ (q - 2k 2 )\ 

0<fci<[f] 
0<fc 2 <[§] 



x P-2ki q-2k^ 



L 5 ™ t m m\ 

m= 



- (2m) 



jr. 



£ kil(p-2k 1 ] 



q\ 



fki p—2ki V^ _ 

! k ^ k 2 \(q-2k 2 ) 



_-f-k 2 q-2k 2 



[§] + [§J ,,„ 



= E 



t m m! 



[Hj»(x,t)-H<»(y,t) 



(7.7) 



- (2m)! 
where the symbol: H^ 2 \x, t) ■ HJfUy, t) stands for the homogeneous part of degree 

P ' Jm 

m, with respect to the t variable, contained into the product: (H^ 2 \x,t)-H^ 2 \y,t)j, 
i.e. in equation (7.7) we have to erase all the terms for which the exponent of t 
appearing in the product (H^ 2 \x,t)-H^ 2 \y,t)) is different from the index m (as a 
matter of fact, the condition k\ + k 2 = m must hold true). 
Proceeding in an analogous way we find: 



sinh t 



d 2 



d 2 



dx 2 dy 2 



x h y r 



E 



j2m+\ 



d 2 d 2 
+ 



q (2m + 1)! \dx 2 dy 



x h f 



dx 2 dy 2 



i \!± 

a \ 2 



[l]+[i 



E 



t 



2m+l 



' (2m + 1)! h ^ =m h\k 2 \ (h - 2h)\ (r - 2A; 2 )! 



E 

ki+k 2 =m 
0<fei<[|] 

0<*2<[i] 



/;/! 



h\ 



V ! 



x h-2ki r-2k 2 



'8P_ 

dx 2 



8 2 \3L»™ t m+l m | 

2 J 2.^ 



dy 



- (2m + 1)! 



/i! 



,§,*i!(*-2*i)! 



iV~ 2fel ]T 



■y* I 



fc ^ fc 2 !(r-2A; 2 )! 



t k2 y r-2k 2 



8 2 8 2 N ^ 

+ 



§[fM 



<9x 2 <9y 2 



E 



2j t m+l m | 

' (2m +1)! 



Hj?\x,t) ■ Hi 2 \y,t) 



(7- 
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where the symbol H^ (x, t) ■ H^(y, t) denotes the homogeneous part of degree to, 

with respect to the t variable, contained into the product: (H^ (x,t) ■ H^(y,t)). 
As a consequence, in this case, the solution of the problem (7.1) becomes: 



( qi qi \ 2 \ ( d 2 d 

*<■■■■ " ' > - ™* I « ^ + a?) xV + sinh K& 5 + W 



-'. \ i\ (pa pp \-\ 



d 2 d 2 
+ 



dx 2 dy 2 



x h y r 



hr I2J ,2m 



E 



E 



m! 



p! 



q\ 



(2m)! fei+ ^ =m ^!A: 2 ! (p - 2*x)! fa - 2A; 2 )! 

0<*i<[f] 

0<fc 2 <[§] 



x p-2k ly q-2k 2 + 



l + i 



+ E 



t 



2m+l 



E 



m! 



/i! 



y -. 1 



' (2to + 1)! fci+ ^ =m fcxlfe! (/i - 2*0! (r - 2A; 2 )! 

0<fci<[|] 

0<*2<[i] 



x h-2k! r-2k 2 



UJ + I.2J + m 



E 



t m m! r 



o (2m)! 



HJfi(x,t).HW(y,t)+ £ 



!] + [§] t m+l m] 



m ^ (2TO + 1)! 



HP(x,t).HP(y,t) 



Consider now the case of general analytic data (7.3). Then the solution of the 
problem (7.1) becomes: 



pp Pj2 \ 2 \ OO 

^, 2/ ,t) = cosh(t( — + — J j x> Q * ai y a2 + 

|o|<n 
2 £)2 \ 2 \ / pp pp \ ~ 2 oo 



- 1 - smh | t —, + — - 
\ ox 2 ay 2 



d 2 d 2 
+ 



dx 2 dy 2 



E bxhy* 



n=0 

\0\<n 



x; a Q cosh it ( i^ + i^V i •<•'"- .</ ,2 + 



n=0 

|ct|<ra 



9a; 2 ch/ 2 



oo ( / pp Q 2 \^ 

dx 2 ' ch/ 2 / ^ I \<9x 2 <9t/ 2 / ' 

|/3|<n 



/ <9 2 <9 2 ^ x 
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oo [ 2 J + [ 2 \ i2m 

E a <* E 72^vr E 

„=0 m=0 V /• k 1 +k 2 =m 

H^ n 0<fcx<[^L] 

0<k 2 <[f} 



ml ail 



h\k 2 \ (e*i - 2^)! (a 2 - 2k 2 )\ 



"-' ai-2fei a 2 -2k 2 _i_ 



oo [ 2 J + [ 2 J ^2m+l 

n=0 m=0 

l/9|<n 



to! ft! 



ft! 



Y^ ""• Hi-- 

(2m +1)! fei+ tl m fci!fc 2 !(A-2fci)!'(A, - 2A; 2 

0<fci<[^] 
0<fc 2 <[^] 



_ x Pl-2k x B 2 -2k 2 



2 I ■ L 2 J t m m . 



oo 

E °« E ,-,; W 

n=0 m=0 ^ Zm ^ 

|a|<n 



HW(a;,t)-H£)(y,t) 



f£Li + r^2i 

OO [ 2 J^l 2 J t m + 1 m \ 

l/3|<n 



H%\x,t)-H%(y,t) 



•h 



(7.9) 



where we used the same notations of equations (7.7)-(7.8). 

A convergence condition for the series expansions in equation (7.9) is expressed by 
the theorem: 

Theorem 7.2 Suppose there exist numbers & > 1 and pi> 1, i — 1,2 snc/i £/ia£ 
£/ie coefficients a a and bp of the series in equation (7.9) satisfy the condition: 



\a a \ = O 



1 1 



«!«!! Q.2 CK2!, 



IM = O 



1 1 



/3f A! /3r/? 2 ! 



(7.10) 



i/zen £/ie series are absolutely and uniformly convergent in every bounded region of the 
(x,y,t) -space. 

Proof. We consider the first of the series expansions in equation (7.9), since the result 
can be proved in the same way for the second one. 
By using the estimate: 



a 2\ + [ a 2 2 } t m ml 

h (2m]! 



^ \t\ m m\ 
~ ^ n (2m)! 

m=U V / 



since 



m! 1 

< 



(2m)! to! 
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we find: 



ffl + lf 

E 

m=0 



t m m\ 



(2m)! 



OO | + |™ 

<E73TT= ex P{N}- 



m=0 



[my. 



Furthermore, by exploiting the estimate (4.2) for the H-KdF polynomials, we find: 

H^(\x\, \t\) ■ H®(\y\, \t\)\ < ai \a 2 \ exp{\x\ + 2\t\ + \y\} 
We have so derived the estimate: 



OO 

E k 

n=0 

\a\<n 



^\+m 



v t m m\ 



Hjg(x,t)-H<g(y,t) 



< exp{|x| +3|t| + \y\} J^ |a Q |ai!a 2 ! 



ra=0 
|a|<n 



so that, since by hypothesis the last series are convergent, the series expansion (7.9) 
satisfies the Weierstrass test in every bounded region of the (x, y, £)-space. 

We conclude as follows: supposing that the coefficients of the analytic initial data of 
problem (7.1) satisfy conditions (7.10), then the solution of the same problem can be 
expressed by the series expansion (7.9), in terms of the H-KdF polynomials, the series 
being absolutely and uniformly convergent in every bounded region of the (x, y, t) -space. 

8 The hyperbolic problem in R n 

Putting: 

X_ \%lj %1i ■ ■ ■ i •fcn—Xji 

we introduce the multi-index notation: 

a := (ai,. .. , a n _i) 

n— 1 

|a| := Y^ a i 

i=l 



and furthermore we set: 



A: = 



d 2 d 2 



d 2 



dx\ dx\ dx 2 l _ 1 

The results of the preceding section can be generalized as follows: 
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Theorem 8.1 Consider the hyperbolic problem in R n ; with analytic initial data q(x) 

and v(x): 

d 2 S 

——- = A S in the half — space t > 0, 

W d (8.1) 

S(x,0) = q(x), —S(x,0)=v(x). 

The operational solution of the problem (8.1) is given by: 

S(x, t) = cosh (*A5) g(x) + sinh (*A^ iy(x) (8.2) 

where, by definition, we put: w(x) := A~2-y(x). 

Considering the analytic initial data, in compact notation: 

oo oo 

sfe) = S a «^ a ' u (^) = J2 b p^ ( 8 - 3 ) 

r=0 r=0 

\a\<r \fi\<r 

and generalizing equation (7.6): 

A m x a = V — -^ - t ai \ „ gr~ 2fcl -7 ^ xT^n-I 1 " 2 *"- 1 (8.4) 

" fcl+ ...^-i=m fei!---*n-i!(ai-2A;i)! * (a n _ 1 -2fc„_ 1 )! » * l ; 

0<ifc;<fe],j=l:n-l 

where m < Y%=i if > we find the explicit solution in the form: 



S(x,£) = J2 «« cosh(tA^)^ a +A"5 ^ fy, s inh(tA5 



,2l/ 



r=0 r=0 

\a\<r \p\<r 



oo Z-rt=iL 2 J i2m 
/ . Oj(y / 777 77 / ~ ; ; ~ 7 77 77<^1 '" ~ ~ ~X r 



E V""* ^" ^1' ai — 2fci <^n— 1- a n _i— 2fc n _i 

n (2m)!, V fci!---fcn-i! («i-2A; 1 )! ;ri "" (a n _i - 2A; n _ 1 )! ;r ' 1 - 1 

H^ r 0<ki<[%],i=l:n-l 



e::^ 



t 2m+1 ys m\ ],! .,,_,,, i 

_ m=0 \^ m + 1 )- fel+ ... +fcn _ 1=m 



+E^E ?^ — n, E — -^ — ■ - yi ~- ^ 1 ' 2k1 - ^ — ^f- 1 - 2 *"- 1 



(2m + 1)! ki+ ...X_ i=m h\ ■ ■ ■ fcn-i! (A - 2*0! ! (A»-i - 2fc B _ 1 )!' 

0<fcj<[§-l,i=l:n-l 
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U=i L^J t m m \ 



E«« E ^K'^.tl-eK-^) 



|a|<r 



y^«-i [ft 
oo ^ i=1 [ 2 J +m+l \ 

+ Y,h E jr-^fti,*)"'^^^ , (8-5) 



r=0 m=0 

|fl|<r 



(2m + 1)! 



where H$(xi,t) • • • HJ® (x n -i,t) denotes the homogeneous part of degree m, 

1 ™ ' Jm 

with respect to the t variable, contained into the product \H^ (x\, t) ■ ■ ■ H^_ t (x n _i, t) 

The Theorem 4.2 in the n-dimensional case becomes: 

Theorem 8.2 // there exist numbers ^ > 1, pi > 1, (i — 1,2, . . . ,n — 1), such that 
the coefficients a a and bp of the series expansions in equation (8.5) satisfy: 

|aa|=0 (^'"a B _ 1 ^a B _ 1 !)' M=0 {^/--(3 n -A^) ^ 

then the two series are absolutely and uniformly convergent in every bounded region of 
the (x,t) -space. 

9 The Dirichlet problem for the Laplace equation in R 3 

In this section, putting A := d 2 /dx 2 + d 2 /dy 2 + d 2 /dz 2 , we find explicit solutions for 
the Dirichlet problem in R 3 , with analytic boundary conditions. Convergence results 
are also derived. 

Theorem 9.1 Consider the Dirichlet problem for the Laplace equation in the half- 
space z > 0, with analytic boundary condition q(x,y), on the real axis z = 0: 

{AS(x, y, z) = 0, in the half — space z > 0, 
S(x,y,0) = q(x,y) . 

The operational solution of the problem (9.1) is given by: 

( d 2 d 2 \ 2 
S(x,y,z)=cos\zl— + —j }q(x,y) (9.2) 
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Proof. Note that: 

as 

dz 



dx 2 dy 2 ) 



( d 2 d 2 * 
S11) i z ^~^ + ^~^ 

V ox 2 ay 1 



q(x,y) 



d 2 s 

dz 2 



d 2 d 2 
+ 



d 2 d 2 



v <9x 2 dy 2 

/<9 2 ,S d 2 S" 
+ 



cos 2 



(9a; 2 dy 2 



q(x,y) 



y dx 2 dy 2 J 

and the boundary condition, for z = 0, is trivially satisfied. 

It is possible again to express the explicit solution of the problem (9.1), when the 
boundary condition is given by the analytic function: 



Q(x,y) = E a aX ai y a2 , 

n=0 
\a\<n 

where the multi-index notation is used. 
Let us consider first the case when 

q(x,y) =x p y q . 

Using equation (7.5) we find the explicit solution in the form: 



( d 2 <9 2 \ f 
S{*,V,z) = cos | *l — + — 1 



x p y q 



E(-i) 



Jim 



(2m)! 



d 2 d 2 



dx 2 dy 2 



2ni 



x p y q 



[iMl] 

E (-i) r 



z 2m 4-1 ( m\ p\ 



(9.3) 



m=0 



s + § 



(2m)! h'AkJ (p-2k)\ (q-2(m-k))\ 



x P~2k q-2(m-k) 






HW(x,z)-HW(y, 



(9.4) 



where the symbol H^ 2 \x,z) ■ H^(y,z) denotes the homogeneous part of degree 



m, 



with respect to the z variable, contained in the product (H^(x, z) ■ H^(y, 
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i.e. all the terms whose index m is different from the exponent of z must be replaced 
by zero. 

Note that, in order to obtain the last equation, we decomposed first z 2m in the form 
z 2m = z m ■ z kl ■ z k2 (k\ + k 2 = m), so that, by using (7.6) which is equivalent to (7.5), 
it follows: 



[IMfJ 



E (-i) ? 

m=0 



■_? V m - P- z fcl p-2fci ?• k 2 q-2k 2 

( 2m ) l kl £;= m k ^ l (p- 2k ^ (?-2fe)! y 

0<*i<[|] 



0<fc 2 <[ 



^ fcl x p_2fcl 



Using the definition of the H-KdF polynomials: 
we find: 



H ' ){x - z)=p % a mp-2 kl y. 



■i\ z k 2 yq-2k 2 



k 2 =0 



k 2 \(q-2k 2 )\ 



[§]+[§] 



ra=0 



(2m)! 



P!E 



-J z ki T p—2ki 



L2J k 2 q-2k 2 



S,fci!(p-2fc 1 )! ,i - j ^ ) Af 2 !((z-2fc 2 )! 



so that equation (9.4) holds true. 

In the general case, when the boundary condition is expressed in the form (9.3), the 

explicit solution becomes: 

( d 2 d 2 \^\ °° 
S{x,y,z) = cos \z\ — + —j I Y<a a x ai y a2 = 

\a\<n 



E 

n=0 
|o|<n 



a a cos 1 z 



dx 2 dy 2 



x ai y a2 



m+m 



^1 

z 2m [ A J (m\ «i! 



a 2 ! 



00 

V a V (-l) m - V 

n =o a h { ' (2m)to\kJ(a 1 -2k)\(a 2 -2(m-k))\ 

\a\<n 



^+Pfl 



z ml 



00 

n=0 m=0 l Zm ^- 

|a|<n 



H${x,z)-H!g(y,z) 



ai-2fe a 2 -2(m-k) 



(9.5) 



A convergence condition, for the series expansion in equation (9.5), is given by the 
theorem: 
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Theorem 9.2 // there exist numbers £1 > 1, £2 > 1 such that the coefficients a a 
of the series (9.5) satisfy the estimate: 



\a„\ = O 



1 



1 



^aftti! af 2 a ^- 



6, 



(9.6) 



i/ien £/ie series (9.5) is absolutely and uniformly convergent in every bounded region of 
the (x,y, z) -space. 

Proof. 

The result follows from the estimates: 



E 

m=0 



(2m)! 



<E 



00 \ y\tn 



m=0 



\m)\ 



exp{>|}. 



H<®(\x\, \z\) ■ HJ»(\y\, \z\)\ < ai \a 2 \ exp{\x\ + 2\z\ + \y\}, 

hence the series (9.5) is dominated by 

00 
exp{|x| + 3\z\ + \y\} J^ |a Q |o;i!a2! 

n=0 

|a|<n 

which is convergent under the considered hypotheses, so that the series (9.5) satisfies 
the Weierstrass test in every bounded region of the (x, y, z)-space. 

We conclude: supposing that the coefficients of the analytic boundary condition 
(9.3) of the Dirichlet problem (9.1) satisfy (9.6), then the solution of the same problem 
can be expressed by the series expansion (9.5), in terms of the H-KdF polynomials, 
the series being absolutely and uniformly convergent in every bounded region of the 
(x,y, z)- space. 



10 The Dirichlet problem for the Laplace equation in R n 

Consider now the n-dimensional Laplace operator A := d 2 / 'dx\+d 2 / 'dxl+ . . .+d 2 /dx 2 l , 
in the half-space x n > 0, and the Dirichlet problem with boundary condition on 
the real x n axis, expressed by the analytic function q(x), where we put again 

X_ . \X±, • • • , X n ^ 1). 

Then: 

Theorem 10.1 The operational solution of the problem 



A n S(x, x n ) — 0, in the half — space x n > 0, 
S(x, 0) = q(x). 



10.1) 



HERMITE-KAMPE DE FERIET POLYNOMIALS... 27 



is given by 

S(x, x n ) = cos (x n A2 J q(x) (10-2) 

If the boundary condition is expressed by the function: 

oo 

r=0 

\a\<r 

(where we used again the compact multi-index notation), then the explicit solution in 
terms of the H-KdF polynomials is: 

oo oo oo 2m 

S(x,x n ) = cos (i n Ai) E«^ a =E fl «E (-1)™^A"V* = 

|a|<r |ct| <r" 



z:::m 



Jim 



_\p yv i\m X n \p m - a l- ai-2fci Q n-1- q n _i-2fc n _i 

~h ^o } {2m)\ ki+ ^ n _^k l \...k n . 1 \{a 1 -2k l )\ 1 '"(a^- 2^)! - 1 



^pn-lro^l 

oo L~ii = \ L 2 J m 

= E«« E (-irT^y^^i,^)---^!,^-!,^^. (io.3) 

|a|<r 

A convergence condition, for the series expansions in equation (10.3), is given by the 
theorem: 

Theorem 10.2 // there exist n — 1 numbers ^ > 1, (i — 1, 2, . . . , n — 1), stic/i t/iat 
the coefficients a a of the series (10.3) satisfy the estimate: 

K\ = O f-^-, y\ ,) , (10.4) 

then the series (10.3) are absolutely and uniformly convergent in every bounded region 
of the (x, x n ) -space. 
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Abstract. Some properties of generalized m-accretive mappings in Banach 
spaces are proved in this paper. We also introduce and study a new class 
of variational inclusions with generalized m-accretive mappings and construct 
a new algorithm for approximating the solution of the variational inclusions 
with generalized m-accretive mappings in q-uniformly smooth Banach spaces 
by using the resolvent operator technique. 

2000 Mathematics Subject Classification. Primary 47H06; Secondary 
49H40. 

Key words and phrases. Generalized m-accretive mapping, variational 
inclusion, resolvent operator, iterative algorithm, convergence. 

1. Introduction 

Variational inequality theory, as a very effective and powerful tool of the current 
mathematical technology has been widely applied to mechanics, physics, optimiza- 
tion and control, economic and transportation equilibrium, and engineering sci- 
ences, etc. (see [l]-[6], [8]-[13], [15] and the references therein). Because of its wide 
applications, the classical variational inequality has been generalized in various di- 
rections for the past years. Variational inclusion is an important generalization of 
variational inequality and has been studied by many authors. We also know that 
one of the most important and interesting problems in the theory of variational in- 
equality is the development of an efficient and implcmcntablc algorithm for solving 
various variational inequalities and variational inclusions. In recent years, many 
numerical methods have been developed for solving various classes of variational 
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inequalities and variational inclusions in Euclidean Spaces or Hilbcrt spaces, such 
as the projection method and its variant forms, linear approximation, descent, and 
Newton's methods. However, few iterative algorithms have been developed for 
solving variational inequality and inclusion problems in Banach spaces. 

Recently, Huang and Fang [7] introduced the concept of a generalized m-accretive 
mapping, which is a generalization of an m-accretive mapping, and gave the defi- 
nition of the resolvent operator for a generalized m-accretive mapping in a Banach 
space. 

Motivated and inspired by the results in [2], [7] and [9], in this paper, we prove the 
Lipschitz continuity of the resolvent operator for a generalized m-accrctivc mapping 
in Banach spaces and construct a new iterative algorithm for solving a new class of 
variational inclusions with generalized m-accretive mappings in g-uniformly smooth 
Banach spaces by using the resolvent operator technique. 

2. Preliminaries 

Let X be a real Banach space with the dual space X* and (•, •) be the dual pair 
between X and X* . The generalized duality mapping J q : X — > 2 X is defined by 

J q (x) = {f* eI*:(x,f)HNr and ||/*|| = Nl*" 1 }, x e X, 

where q > 1 is a constant. In particular, J 2 is the usual normalized duality mapping. 
It is known that, in general, J q (x) = ||a;|| 9 ~ 2 J 2 (x) for all x e X (x ^ 0) and J q is 
single- valued if X* is strictly convex (see, for example, [14]). If X = H is a Hilbcrt 
space, then J 2 becomes the identity mapping of H. In the sequel, we shall denote 
the single- valued generalized duality mapping by j q . 

The modulus of smoothness of X is the function px '■ [0, oo) — ► [0, oo) defined by 

Px (t) = sup {l(||a; + y\\ + \\x - y\\) - 1 : ||x|| < 1, \\y\\ < i}. 
A Banach space X is said to be uniformly smooth if 

lim **M = 0. 
t-»o t 

The Banach space X is said to be q-uniformly smooth if there exists a constant 

c > such that 

px < ct q , q>l. 

All Hilbert spaces, L p (or l p ) spaces, 1 < p < oo, and the Sobolev spaces 
W m,p , 1 < p < oo, are all q-uniformly smooth. 

In the study of the characteristic inequalities in q-uniformly smooth Banach 
spaces, Xu [14] proved the following theorem: 

Theorem X. Let X be a real uniformly smooth Banach space. Then X is q- 
uniformly smooth if and only if there exists a constant c q > such that, for all 

x, y e x, 

\\ x + y\\i<\\ x \\i + q( y ,j q ( x )}+ Cq \\y\\ q . 

Definition 2.1. A multivalued mapping A : X — > 2 X is said to be 

(1) accretive if, for any u, v g X, there exists j%{u — v) € J 2 (w — v) such that 

(x-y,j 2 (u-v)) > 0, x e Au, y <G Av, 
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(2) strictly accretive if, for any u, v G X, there exists ^(u — v) G J2( M — w ) such 
that 

(x — y, J2(u — v)} > 0, i G Am, i/ e in, 

and the equality holds if and only if u = t), 

(3) strongly accretive if, for any u, v G X, there exists ji(u — v) G J2(w — v) such 
that 

(a; — y,J2(u — v)) > j\\u — v\\ 2 , x G -Att, y G Aw, 

where 7 > is a constant, 

(4) m-accretive if A is accretive and (/ + XA)(X) = X for all (equivalently, for 
some) A > 0, where I denotes the identity mapping. 

Remark 2.1. (1) In Definition 2.1, we can replace ji by j q for all q > 1 and the 
strongly accretive constant 7 is independent of q. 

(2) If X = X* = H is a HUbert space, then (1)— (4) of Definition 2.1 reduce to the 
definitions of monotonicity, strict monotonicity, strong monotonicity and maximal 
monotonicity, respectively. 

Definition 2.2. A mapping T : X — > X is said to be Lipschitz continuous if 
there exists a constant k > such that 

||Tu-Tu|| < fe||u-v||, u,ue X. 

3. Generalized to- Accretive Mappings 

Definition 3.1. A mapping n : X x X ^ X* is said to be 

(1) monotone if 

(u — v, T](u, v)) > 0, u, v € X, 

(2) strictly monotone if 

(u — v, T)(u, v)} > 0, u, v € X, 

and the equality holds if and only if u = w, 

(3) strongly monotone if there exists a constant <5 > such that 

(u — v, rj(u,v)) > S\\u — v\\ , u,«eA, 

(4) Lipschitz continuous if there exists a constant r > such that 

11^(^,^)11 < t||w — v\\, u, v G X. 
Note that the strong monotonicity of n implies the strict monotonicity of n. 

Definition 3.2. ([7]) Let 77 : X x X — > X* be a single-valued mapping and 
M : X — > 2 X be a multivalued mapping. Then M is said to be 

(1) r\- accretive if 

(x — y, w{u, v)) > 0, u, w G X, x G M«, y G M», 

(2) strictly rj-accretive if 

(x — y, r/(u, v)) > 0, u, w G X, x G Mu, y G Aft), 

and the equality holds if and only if u = t), 

(3) strongly n-accretive if there exists a constant r > such that 

(x — y, r](u, v)) > r\\u — v\\ 2 1 u, v G X, x G Afti, t/ G Aft), 
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(4) generalized m-accretive if M is 77-accretive and (I + XM)(X) = X for all 
(cquivalcntly, for some) A > 0. 

Remark 3.1. (1) In [7], Huang and Fang introduced the concept of generalized 
m-accretive mappings in Banach spaces and gave an example of the generalized 
m-accretive mapping and some properties of the generalized m-accretive mapping. 

(2) If X = X* = H is a Hilbert space, then (l)-(4) of Definition 3.2 reduce to 
the definitions of 77-monotonicity, strict 77-monotonicity, strong 77-monotonicity and 
maximal ?7-monotonicity, respectively. 

(3) If X is uniformly smooth and rj(x, y) = Ji{x — y), then (l)-(4) of Definition 
3.2 reduce to the definitions of accretivity, strict accretivity, strong accretivity, and 
m-accretivity in uniformly smooth Banach spaces, respectively. 

Now we prove some properties of the generalized m-accretive mapping in Banach 
spaces. 

Theorem 3.1. Let 77 : X x X — ► X* be strictly monotone and M : X — > 2 X be 
a generalized m-accretive mapping. Then the following conclusions hold: 

(1) (x—y, r/(u, v)) > for all (y, v) G Graph(M) implies that (x, u) G Graph(M) , 
where Graph(M) = {(x,u) G X x X : x G Mu}. 

(2) The inverse mapping (I + AM) -1 is single-valued for all A > 0. 

Proof. Suppose that (1) is false. Then there exists (x 0} u ) $■ Graph(M) such 
that 

(3.1) (x - V, Viuo, v)) > 0, (y, v) e Graph(M). 

Since M is generalized m-accretive, we have (I + \M)(X) = X. Then there exists 
(xi,ui) G Graph(M) such that 

(3.2) u\ + \x\ = uq + Aa;o- 
It follows from (3.1) and (3.2) that 

A(x - X!,Tl(u , Ui)> = (iti - Uq, T](u , Ul)) > 0. 

This implies that 

(u -«i,Jy(«o,Wi)) < 0. 

Since n is strictly monotone, we have uq — u\ and hence, from (3.2), we have 
x\ = xq. This contradicts the fact (xo,uo) ^ Graph(M). Thus (1) is true. 

Now we prove (2). For any given z G X and a constant A > 0, let u, v G 
(7+ AM)" 1 ^). Then \- 1 (z-u) e M(u) and \- 1 (z-v) e M(v). By ^-accretivity 
of M, we obtain 

= (z — z,r)(u,v)) 

= \(-(z — U) — ~(z — v),7](u,v)) + (ll— V,T)(u,v)) 
A A 

> (u — v,r)(u, v)). 

Since r\ is strictly monotone, we have u = v. Thus (I + AM) -1 is single- valued. 
This completes the proof. 



GENERALIZED M-ACCRETIVE MAPPINGS... 35 



Based on Theorem 3.1, we can define the resolvent operator for a generalized 
m-accretive mapping M as follows: 

J™(z) = (I + pM)-\z), zeX, 

where p > is a constant and i):IxX-> X* is a strictly monotone mapping. 

Theorem 3.2. Let r\ : X x X — ► X* be strongly monotone and Lipschitz con- 
tinuous with constants 5 > and t > 0, respectively, and M : X — > 2 X be a 
generalized m-accretive mapping. Then the resolvent operator J^ 1 for M is Lips- 
chitz continuous with constant t/S, i.e., 

(3.3) \\jM{u)-jf{v)\\< T -\\u-v\\, u,v€X. 



Proof. Let u, v be any given points in X. From the definition of Jr , we have 
J™{u) = (I + pM)-\u), JM(v) = (I + pM)-\v), 



which imply that 



- p (u- J™{u)) G M{jf{u)), l -{v - Jf{v)) G M{J™{v)). 
Since M is 77-accretive, we obtain 

- p (u- J™(u) -(v- jM{v)), V {jf{u),jM{v))) 

= - p {u-v- (J p M («) - J?(v)), r,(J p M ( U ), JM{v))) 
>0. 
From the above inequality, we have 



< 



(u-v,r,(J?(u),J?(v))) 



<t\\u-v\\-\\J™(u)-J™(v)\\, 
which implies that 

\\J^(u)-J^(v)\\<^\\u-v\\, u,vex. 
This completes the proof. 

4. Variational Inclusions 

In this section, we shall study a new class of variational inclusions with gener- 
alized to- accretive mappings in Banach spaces. We also construct a new iterative 
algorithm for approximating the solution of this class of variational inclusions with 
generalized TO-accrctivc mappings in ^-uniformly smooth Banach spaces by using 
the resolvent operator technique. 

Let r\ : X x X — > X* and A : X — > X be two single-valued mappings and 
M : X — > 2 X be a generalized m-accretive multivalued mapping. Now we consider 
the following problem: 
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Find u G X such that 

(4.1) OeA(u)+M(u). 

Remark 4.1. (1) If X = X* = H is aHilbert space and n(u, v) = u—v, then the 
problem (4.1) becomes the usual variational inclusion with a maximal monotone 
mapping M . 

(2) If X = X* = H is a Hilbert space, n(u, v) = u — v and M = dip, where dip 
denotes the subdiffercntial of a proper convex lower semicontinuous function ip on 
H, then the problem (4.1) becomes the following classical variational inequality: 

Find u G H such that 

(Au, v — u) + p(v) — p(u) > 0, v G H. 

Lemma 4.1. For given u G X , u is a solution of the problem (4-1) if and only 

if 

u = Jp 4 (u- pAu), 

where p > is a constant. 

Proof. The proof directly follows from the definition of J^ 4 . 

Lemma 4.2. Let {a n }, {b n } and {c n } be three sequences of nonnegative real 
numbers satisfying the following conditions: 
There exists a positive integer no such that 

(4.2) a n+1 < (1 - t n )a n + b n t n + c„, n > n , 
where 

oo oo 

t n €[0, 1], y^t„ = +oo, lim b n = 0, y^c„<+oo. 

A — » n — >oo * — » 

n=0 n=0 

Then a n — > as n —> +oo. 

Proof. Let er = inf{a„ : n > no}. Then a > 0. Suppose that <r > and so 
a « > & > for all n > n . It follows from (4.2) that 

(4.3) a n+1 < a n - at n + t n b n + c n 

.1 1 

= a n - (-er - o„)£„ - -crt„ + c„ n > n . 

Since 6„ — > as n — > oo, there exists ni > no such that 

(4.4) 2 Cr - & ™' n ^ n i- 
Combining (4.3) and (4.4), then we have 

1 
a n +i <a n - -at n + c„, n>n x , 

which implies that 

1 oo oo 

ro=ni n-ni 

This is a contradiction. Therefore, a = and so there exists a subsequence {a n . } 
of {a n } such that a„. — > as j — > oo. It follows from (4.2) that 

a ri,+l < On^ + b n -t n . + C n . 
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and so a„.+i — > as j — > oo. A simple induction leads to a n+ k —> as j — ► oo for 
all k > 1 and this means that a n — > as n — > oo. This completes the proof. 

Theorem 4.1. Lei X be a q-uniformly smooth Banach space. Let ij:IxI-> 
X* be a strongly monotone and Lipschitz continuous mapping with constants S 
and t, respectively, A : X —> X be a strong accretive and Lipschitz continuous 
single-valued mapping with constants r and s, respectively, and M : X —> 2 X be a 
generalized m-accretive multivalued mapping. Assume that there exists p > such 
that 

(4.5) t(1 - rqp + c q s q p q ) \ < 6, 

where c q > is the same as in Theorem X. Then the problem (4-1) has a unique 
solution u* G X. 

Proof. Define a mapping F : X — > X by 

(4.6) F(u) = J™(u-pAu), ueX. 
It follows from (3.3) and (4.6) that 

(4.7) ||F(«) - F(v)\\ = \\Jff{u pAu) J™(v pAv)\\ 

< -z\\u- v - p(Au - Av)\\. 
o 

Since X is g-uniformly smooth, we have 

(4.8) \\u- v- p(Au- Av)\\ q < \\u-v\\ q - qp(Au - Av, j g (u - v)) 

+ c qP q \\Au - Av\\ q 
< (1 -qrp+c q s q p q )\\u- v\\ q . 

Combining (4.7) and (4.8), then we have 

r 

s { 

It follows from (4.5) that F is a contractive mapping and so there exists a unique 
point u* G X such that 

u* = J™{u* - pAu). 

Therefore, by Lemma 4.1, u* G X is a unique solution of the problem (4.1). This 
completes the proof. 

Remark 4.2. If X is 2-uniformly smooth and there exists p > such that 

(a a\ i r i ^ \/r 2 r 2 -c 2 s 2 {t 2 -8 2 ) 2 2 22 2 

(4.9) \p o < 2 ' rT > C 2 S ( T - (5 ) J 

CiS C2TS Z 

then (4.5) holds. We note that all Hilbert spaces and L p (or l q ) spaces, 2 < q < oo, 
are 2-uniformly smooth. 

Algorithm 4.1. Let r\ : X x X — > X* , A : X — > X be two single-valued 
mappings and M n : X — ► 2 X be a generalized m-accretive multivalued mapping for 
each i = 1, 2, 3 • • • . For any given uq G X , define an iterative sequence {u n } by 

/ 41f) \ \ u n+1 = (1- a n )u n + a n jM"(v n - pAv n ), 

\ V n = (1 - fln)u n + /3 n jM"(u n - pAu n ), 



\\F(u) - F(v)\\ < -{l-qrp + c q s q p q )S\\u-v\\ 
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where {a n } and {f3 n } be two sequences in [0, 1] satisfying the following 

oo 

^a„ = +oo. 

Theorem 4.2. Let X, n, A and M be the same as in Theorem 4.1. Let M n : 
X — > 2 X &e a generalized m-accretive multivalued mapping for each i — 1, 2, 3, • • • . 
Assume that there exists p > such that, for all z <G X. 

lim ||J p M "(z)-J p M (z)||=0 

and the condition (4.5) holds. Then the iterative sequence {u n } generated by Algo- 
rithm 4-1 converges strongly to the unique solution u* G X of the problem (4.1). 

Proof. Let u* <G X be the unique solution of the problem (4.1). It follows from 
Lemma 4.1 that 

(4.11) u* = (1 - a n )u* + a n Jp'(u* - pAu*) 

= (1 - /?„K + Pn Jp'{u* - pAu*). 
From (4.10) and (4.11), we have 

(4.12) \\u n+l -u*\\ 

< (1 -a n )||w„ - w*|| +a n \\jjf n (v n - pAv n ) - jjf(u* - pAu*)\\ 

< (1 - a n )\\un - u*\\ + a n \\jf"(v n - pAv n ) - Jf"(u* - pAu*)\\ 
+ a n \\J™ n (u* - pAu*) - Jf \u* - pAu*)\\ 

T 

< (1 -a n )\\u n -u || +a n -\\v n -u* - p(Av n - Au )\\ + a n g n , 

where 

g n = ||jf »(«* - pAu*) J™(u* pAu*)\\ ^ 0. 
Since X is g-uniformly smooth and A is strongly accretive and Lipschitz continuous, 
we have 

(4.13) \\v n -u*-p{Av n -Au*)\\ q 

< \\v n - u*\\ q - qp(Av n - Au*,j q (v n -u*)) +c q p q \\Av n - Au*\\ q 
<(l-qrp + c q s q p q )\\v n -u*\\ q . 

Substituting (4.13) into (4.12), then we obtain 

(4.14) \\u n+1 -U*\\ < (1 - a n )\\u n - u*\\ +a n h\\v n -U*\\ + a n g n , 
where 



h= -(1 - qrp + c q s q p q )i < 1. 



r 

Similarly, we can prove that 

(4.15) \\v n -u*\\ < (1 - P n )\\u n -u*\\ + P n h\\u n - u*\\ + a n g n . 
It follows from (4.14) and (4.15) that 

(4.16) \\u n+1 -u*\\ < (1 - a n )\\u n -u*\\ +a n h(l - [3 n )\\u n -u*\\ 

+ a n f3 n h 2 \\u n - u*\\ + c? n hg n + a n g n 
< (1 -a„(l - h))\\u„ -u*\\ +a n g„(a„h+ 1). 
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Letting 



ii *n + n h\ h (a n h + l) 

\u n -u , t n = (1 - h)a n , b n = g n —7z 7w- , c n = 0, 

(1-h) 



then (4.16) can be rewritten as 

O-n+l < (1 — tn)dn + b n t n + Cu- 
lt follows from Lemma 4.2 that a n — > as n — ► oo and so the sequence {u n } 
converges strongly to the unique solution u* G X of the problem (4.1). This 
completes the proof. 

Corollary 4.2. Let X be a 2-uniformly smooth Banach space and n, A, M, 
M n be the same as in Theorem 4-2. If the condition (4.9) holds, then the iterative 
sequence {u n } generated by Algorithm 4.1 converges strongly to the unique solution 
u* e X of the problem (4.1). 
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C 1 ' 1 functions and optimality conditions 

Davide La Torre* Matteo Rocca^ 



Abstract 

In this work we provide a characterization of C 1,1 functions on W 1 (that 
is, differentiable with locally Lipschitz partial derivatives) by means of second 
directional divided differences. In particular, we prove that the class of C 1 ' 1 
functions is equivalent to the class of functions with bounded second direc- 
tional divided differences. From this result we deduce a Taylor's formula for 
this class of functions and some optimality conditions. The characterizations 
and the optimality conditions proved by Riemann derivatives can be useful 
to write minimization algorithms; in fact, only the values of the function are 
required to compute second order conditions. 

Keywords: Divided differences, Riemann derivatives, C 1,1 functions, nonlinear 
optimization, generalized derivatives 

1 Introduction 

The study of the class of C 1 ' 1 functions has been renewed since the work of Hiriart- 
Urruty in his doctoral thesis [7]. The need for investigating these functions, as 
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pointed out in [8], [10], [23], [24] and [25], comes from the fact that several prob- 
lems of applied mathematics including variational inequalities, semi-infinite pro- 
gramming, iterated local minimization, etc. involve differentiable functions with no 
hope to be twice differentiable. In [8] the authors introduced the concept of general- 
ized Hessian matrices and derived second order optimality conditions for nonlinear 
constrained problems. Further applications can be found in [10], [15], [19], [20], 
[22]. 

In this section we recall some concepts which are fundamental for understanding 
the proof of the results. 

1.1 Riemann derivatives 

In the following we will consider a function / : Q — »■ R, with Q an open subset of 
M n . For such a function we define: 

8*f( X ; h) = f(x + 2hd) - 2f\x + hd) + f(x). 

with x e n,h eR and deW 1 . 

Definition 1.1. The second Riemann derivative of f at a point x G Q in the 
direction d el" is defined as: 

if this limit exists. 

Definition 1.2. The second upper and lower Riemann derivatives of f at x £ Q in 
the direction d G lR n are defined, respectively, as: 

-i", ^ r ^f(x;h) 
f r [x; d) = hm sup — , 

f"(x;d)=llmiTd 5 * f( ; X j h \ 

-r v ' ' h^o h 2 

Similarly we can define differences: 

A d J(x; h) = f{x + hd) - 2f(x) + f{x - hd), 

and then the corresponding second Riemann-type derivatives f'^ix; d), f R ix; d) and 
f R ix;d). 

For properties of Riemann derivatives one can see [1], [2], [6] and [16]. 
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Lemma 1.1. Assume that f is bounded in a neighborhood of the point xq G Q. If, 
for a fixed d G IR n , there exist neighborhoods U of the point x and V of G R 
such that 2 ^2' i s bounded on U x V\{0}, then also ^ x+ fe is bounded on 

U x V\{0}. 

Proof. From the hypotheses we obtain that there exists a number S > such that 
\/x £ U and V/i with |/i| < 5, /i 7^ 0, the following inequalities hold: 



f( x + hd ) - f{x) -2[ f [x + ^d) - f{x) 



<M 



/i 



/ ( a; + |d) - /(*) - 2 (7 (* + Jd ) - f{x) 



<M 



h 



f( x + ^d)-f(x)-2(f(x+^d) -f(x) 



4 
< M 



2" 



2 „-i j ^ 1 \ \ 2 T 

Multiplying these inequalities by 1, 2, 2 2 , . . . , 2*™ -1 ) respectively, we obtain by addi- 
tion: 



/(•'■ + /"/) J(.r) 2"[/(x+Ad) -/(,-) 



< 2M 



and hence: 



in /(^ + ^d)-/(x) 



/i 



<M' 



for |5 < \h\ < 5, by using the boundedness of /. Hence, writing £ = ^, we have: 



f(x + Cd)-f(x) 



£ 



A A 

< M' for < \e\ < — , n = 0, 1, . . . , 

— ' 2 ra+1 ~~ ~~ 2 n 



and the lemma is established, since n can be arbitrarily chosen. 



In the following we set: 

, f(x + hd)- f(x) f'(x + hd;d)- f'(x;d) 
t (x; d) = hm , f (x; d) = hm . 



n 



if these limits exist. 
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1.2 Standard mollifiers 

The function: 

{Cexp(p^ rT ), if \\x\\ < 1 
0, if ||x|| > 1 

is C°°(R n ) and we can choose the constant CgR such that: 

4>(x)dx = 1. 
Definition 1.3. Let e > 0. The following functions: 

are called standard mollifiers. 

Definition 1.4. Let f :Q^R. We say that f e C£(Q) if f e C k (Q) and 



sptf = {x G n : f(x) ^ 0} C ft. 
Theorem 1.1. [3] The functions (p £ are C°°(R n ) and satisfy: 

• J Rn (p £ (x)dx = 1 

• spt^ C B(0,e) = {iGl" : ||x|| < e}. 

For a bounded function / : ft — ► R, and 5 > 0, we define functions f £ : R n — *■ R by 
the convolution f £ (x) = f n (f) £ (y—x)f(y)dy. Observe that f £ (x) = if x <£ ft+I?(0, e) 
and that f £ G C°°(R n ). 

Theorem 1.2. [3] Suppose that f G L\ oc (Vt). Then f £ (x) — ► /(x) o.e. rr G ft, w/jen 
£ — > 0. If f E C(ft) then the convergence is uniform on compact subsets o/ft. 

Theorem 1.3. [3] Let K be a compact subset of ft. Then 3eq > such that\/e < Eq 
and \fx G K , the following function: 

y -»■ <t>e{y-x) 

isC™(Q). 
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2 The main results 

Definition 2.1. A function f : Q — ► R is locally Lipschitz at xq when there exist a 
constant K and a neighborhood U of Xq such that: 

\f(x)-f(y)\<K\\x-y\\, Vx,yeU. 

Definition 2.2. A function f : Q — > R is of class C 1,1 at xq when its first order 
partial derivatives exist in a neighborhood of xq and are locally Lipschitz at xq. 

Some possible applications of C 1,1 functions are shown in the following examples. 

Example 2.1. Let g : Q C R n — *■ R be twice continuously differ entiable on Q and 
consider 1 f(x) = [g + (x)] 2 where g + (x) = max{g(x), 0}. Then f is C 1,1 on fi. 

Example 2.2. In many problems in engineering applications and control theory 
([23], [24] and the references therein) one has to study nonsmooth semi-infinite 
optimization problems such as the following: 

minimize f{x) 

subject to max <f)j(x, t) < 0, j = 1 . . . / 

t£[a,6] 

where f : R n ^ R and <f)j : lR ra — *■ R are C 2 , j = I . . .1, -00 < a < x < b < +00. 
One approach for solving this problem is to convert the functional constraints into 
equality constraints of the form: 

hj(x) = / [m&x{(j)j(x, y), 0}] dt = 0,j = l...l 

J a 

and apply the methods of nonlinear programming. Hence the problem becomes: 

minimize f(x) 

subject to hj(x) = 0, j = 1 . . . / 
Since 4>j is C 2 , it is easy see that the function hj is C 1 ' 1 with the gradient: 

f b 

Vhj(x) = 2 / max{0j(:r, £), O}V0j(x, t)dt, j = 1 . . .1. 

J a 



lr This type of functions arises in some penalty methods. 
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Example 2.3. Consider the following minimization problem: 

min/ (x) 

over all x G K n such that f\(x) < 0, . . . f m (x) < 0. Letting r denote a positive 
parameter, the augmented Lagrangian L r [21] is defined on W 1 x W 11 as 



L r (x, y) = f (x) + 1 J2i[Vi + 2rf l (x)] + } 2 - y\. 



From the general theory of duality which yields L r as a particular Lagrangian, we 
know that L r (x, •) is concave and also that L r (-, y) is convex whenever the minimiza- 
tion problem is a convex minimization problem. Upon setting y = in the previous 
expression, we observe that: 

m 

L r (x,0) = f (x)+rY,[fHx)} 2 
i=i 

is the ordinary penalized version of the minimization problem. L r is differentiable 
everywhere on M. n x R m with: 

m 

W x L r (x,y) = V/ (x) + Y}Vj + 2r/,(a:)] + V/,(a:) 

When the f\ are C 2 on W 1 , L r is C 1,1 on IR ra+m . The dual problem corresponding to 
L r is by definition: 

max g r (y) 

over y e M. m , where g r {y) = mf xe ^n L r (x,y). In the convex case with r > 0, g r is 
again C 1,1 concave function with the following uniform Lipschitz property on Vg, 

\Vg r (y)-Vg r (x)\ < ^\y-y'\, Vy,y' e R m . 

The following result characterizes a function of class C 1 ' 1 by the boundness of 
second-order divided differences. 

Theorem 2.1. Assume that the function f : $1 — *■ R is bounded on a neighborhood of 
the point xq G Vt. Then f is of class C 1 ' 1 at xq if and only if there exist neighborhoods 
U of x andVofOeR such that 5if £' h) is bounded on U x V\{0}, Vd G S 1 = 
{deR n : \\d\\ = 1}. 
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Proof, i) Sufficiency. ^From lemma 1.1, since 2 ^' is bounded on U X y\{0}, 
\/d G S 1 , the same holds for — — J~ ■ Observe that this last fact implies that / 
is locally Lipschitz at Xq and hence continuous in a neighborhood of x . For every 
x in a neighborhood of xq and for e "sufficiently small", we have, for d G S 1 : 

f £ (x;d) = \im f ^ X + hd) ' f ^ x) - 



lim . 

/i->0 h 



h^O h 

1 " 



<t>e{y-x-hd)f(y)dy- I <j) e (y - x)f(y)dy 



Putting z = y — hd, we obtain: 

4>e(y ~ x - hd)f(y)dy = / (f> E (z - x)f(z + hd)dz. 

n Jn-{hd} 

From theorem 1.3, we know that, for e "sufficiently small", the functions z 
<j) e (z — x) are C£°(Q) and hence, if also \h\ is "small enough", we get: 



(f) £ (z — x)f(z + hd)dz = I (f) £ (z — x)f(z + hd)dz. 

Q-{hd} Jq 



It follows that: 



fix- d) = lim f tt Z + hd ) f ^ <t> £ (z - x)dz. 
h^Jn h 



Furthermore one can easily see that: 

f £ (x + 2hd)-2f £ (x + hd) + f £ (x) 
f e (x; d) = Inn 

and similarly deduce: 

f'J(x; d) = lim f 6d2f( f 2 ' h) Mz - x)dz. 

From the boundedness of 2 -£ ' and of x+ fe , we obtain the existence of a 

constant M such that \f' £ (x;d)\ < M and \f' £ '(x;d)\ < M, for every d G S* 1 , i in a 
neighborhood U ot x$ and e "sufficiently small". Hence, for every x G U and d £ S l , 
there exists a sequence e n converging to such that f' £ (x; d) converges to a limit 
which we denote by a(x; d). Observe that a(x; d) is bounded on U whenever d G S 1 . 
For every x G U, d G S 1 and h with \h\ "small enough", we can write: 

f £n (x + hd) = f £n ( X ) + hf £n (x-, d) + l -h 2 f: n n n] d), 
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where £ n G (x,x + hd). 

Recalling theorem 1.2, taking the limit for n — ► +00, it follows that f" {£, n ]d) 

converges to a limit which we denote by /3(x, h, d). Moreover: 

fix + hd) = f(x) + ha(x; d) + -h 2 {3(x, h, d). 

Observing that /3(x,h,d) is bounded for x G U, \h\ "sufficiently small" and d G S 1 , 

it follows that a(x;d) = f'(x;d). 

Furthermore, Vrf G S 1 the functions f'J (x;d) are bounded on U uniformly with 

respect to e and thus the functions f' e (x; d) satisfy the following uniform Lipschitz 

condition: 

\fL(y'' d ) - fL( x 'i d )\ ^ B \\v - x\\,Vx,y e u. 

Since f' e {y\ d) and f' e (x; d) converge to f'(y; d) and f'(x; d) respectively, we see that 
f'(x; d) is Lipschitz on U, Vg? G M n . Taking d = e l , i = 1, . . . , n (where e l is the i-th 
fundamental vector of R n ), we obtain the thesis, 
ii) Necessity. Assume that / is of class C 1 ' 1 at Xq. Set: 

A d 2 f(x; s, t) = f{x + sd + td) - fix + td) - f\x + sd) + fix), 

where d G S 1 , x G Q, s,t G R and |s| and |t| are "sufficiently small". Applying the 
mean value theorem, we obtain: 

A d 2 fix; s,t) _ < V/(x + 6>ici + sd) - V/(x + 9td), d > 

st s 

where 6 G (0, 1). Since / is of class C 1,1 at xq it follows easily that there exist a 
constant M, a neighborhood [/ of x$ and a number 5 > such that, Vd G S* 1 we 
have: 



A 2 /(a;;M) 



< M, VxG C/, |s| < 5, |t| < <5. 



si 

Now the thesis follows observing that if s = t — h, then A2/(x; s, t) = S^fix; h). 



n 

Corollary 2.1. Assume that the function f is bounded on a neighborhood of Xq G f2. 
Then f is of class C 1 ' 1 at x if and only if there exist neighborhoods U of Xq and V 
0/O6I such that A *Kf' h) iS bounded on U x V\{0}, W G S l . 
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Proof. The proof is straightforward remembering that: 

5 d 2 f(x;h) = A d 2 f(x + hd;h). 

□ 

Corollary 2.2. If f is of class C 1,1 at xq, there exist sequences e n converging to 
and C, n G (^Oj^o + hd) such that f'J n (^ n ;d) converges to a limit (3(xo,h,d) and it 
holds: 

/(*„ + hd) = f(x ) + f'(x ; d)h + ^° 2 M V . 

Proof. It is enclosed in the proof of the previous theorem. □ 

Theorem 2.2. (Taylor's formula) Let f be a function of class C 1 ' 1 at xq. 

(i) If the function x — > f r (x; d) is upper semicontinuous in a neighborhood of Xq, for 

a fixed d E S 1 , then there exists £ G [xq, x + hd] such that, for h "small enough" 

we have: 

h 2 —n 
f(x + hd) < f(x ) + hf(x ; d) + yf r {£\ d) 

(ii) If the function x — > f"(x; d) is lower semicontinuous in a neighborhood of x , for 
a fixed d G S 1 , then there exists £ G [x , x + hd] such that for h "small enough" 
we have: 

f(x + hd) > f(x ) + hf'(x ; d) + ^£'(£; d) 

Proof, i) Without loss of generality, the term (3(xq; h; d) in the previous corollary 
2.2 can be expressed as: 

P(x ;h;d)= lim /"(£„; d) 

for some sequences £ n — ► £ G [xoj^o + ^d] an d e n — ► 0. Similarly to the proof of 
theorem 2.1, one can write that 2 : 



f'J n (Z n ,d)= / <j>l(y-Z n ;d)f(y)dy 
Jb(o,d h ^° h 2 



'B(0,1) 

2 In the proof of this theorem we will use the following generalized version of Fatou's lemma: if 
f n is a sequence of measurable functions, f n < M and E C K™ is a subset of finite measure, then 
limsup„^ +00 f E f n < f E limsup rw+00 /„ 
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h ^°JB(0,l) h 

/ lim sup n — (f)(y)dy= /"(£„ + e n y;d)(f)(y)dy. 

Jb{0,1) h-+0 II JB(0,1) 

Now using the upper semicontinuity of / r (-; d) we have: 

/3(x; h;d) < / lim sup j" r (£ n + e„y; d)(f)(y)dy < 

JB(0,1) n->+oc 

J"(&d)(f)(y)dy = j' r (£;d) 

'B(0,1) 

and the proof is complete. 

ii) It is similar to the previous proof and we omit it. C 

Theorem 2.3. Assume that f is continuous and f"(x;d) exists on a neighborhood 
of the point xq , Vrf G S 1 . Then f is of class C 1,1 at x if and only if there exist a 
neighborhood U of xq and a function g G L^iJJ) such that the following assumptions 
hold: 

(i) 3M > such that |/;'(x; d)\ < M, Vx G U, VdeS 1 , 
(ii) 2 ^2 ' < g(x), for \h\ "small enough" (h ^ 0), d G S 1 and a.e. x G U. 

Proof, i) Sufficiency. Arguing in a fashion similar to that of theorem 2.1 and using 
Lebesgue theorem, we obtain for e "sufficiently small", for every a; in a neighborhood 
of Xo and d G S 1 : 

f £ (x; d) = lim / — (p e (z - x)dz = 

lim — m £ {z — x)dz = 

/ f"(z;d)(/) £ (z -x)dz. 
Jq 

It follows that \/d G S 1 f' e '(x,d) is bounded on U (uniformly with respect to e). 
Using the integral representation of divided differences (see for instance [9], ch. 6, 
th. 2), we have: 

5 d J £ (x;h) n 



h 2 



/ dti / f"(x + t 2 hd + tihd;d)dt 2 - 
Jo Jo 
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For x and h in suitable neighborhoods of xq and respectively, the left member in 
the previous inequality is bounded by a constant M (uniformly with respect to e). 
Sending e to and recalling theorem 1.2, we get the existence of neighborhoods U 
of x and V of G R such that Vd G S 1 s * f $' ,h) is bounded on U x V\{0}. The 
thesis now follows recalling theorem 2.1. 

ii) Necessity. The proof is similar to that of the necessary condition in theorem 
2.1. C 

Remark 2.1. Hypothesis (ii) in the previous theorem cannot be omitted. In fact, 
as is easily seen, the function f(x) = \x\ satisfies hypothesis (i) but not hypothesis 
(ii) in a neighborhood of Xq = and is not of class C 1,1 at 0. 

Remark 2.2. Theorems 2.1 and 2.3 extend the elementary condition which relates 
the Lipschitz condition on f and the boundedness of f" '. We generalize this relation 
without requiring any differentiability hypothesis and linking the existence and the 
Lipschitz behaviour of f to the boundedness of 2 ^ ' or of the directional Riemann 
derivatives. 

Remark 2.3. Conditions similar to those of theorem 2.3, expressed in terms of 
ffi(x; d) can be proved in analogous way. 

3 Optimality conditions for unconstrained opti- 
mization problems 

The aim of this section is to study necessary and sufficient conditions for C 1,1 un- 
constrained optimization problems. These conditions are proved by using the gen- 
eralized Taylor's expansions given in the previous section and Riemann derivatives. 
These optimality conditions can be used to write minimization algorithms; in fact 
for the computation of the next results only the values of the function are required. 
In the following we will suppose that, for any d G S 1 , the function x — > f r (x, d) is 
upper semicontinuous and that the function x — > f"(x; d) is lower semicontinuous 
in a neighborhood of Xq. 

So we consider the following unconstrained optimization problem: 
PI) min/(x) 

where Q is an open subset of lR n . 
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Theorem 3.1. (necessary condition) If f is C 1 ' 1 and xq is a local minimum point 
then V/(x ) = and J^(x , d) > Vd G S 1 . 

Proof. /,From Taylor's formula we obtain, for h "small enough": 

h 2 — 
f(x + hd) < f(x ) + h< V/(xo), d > +— f r (£, d) 

where £ G [xo, £o + /id]. So 

0<f(x o + hd)-f(x o )<Jl(Z,d) 

and taking the limit when /i — > we obtain the thesis. □ 

Theorem 3.2. (sufficient condition) If f is C 1 ' 1 , V/(xo) = and f f (xo+otd, d) > ; 

Va G (0, 1) and Vd E S 1 , then xq is a strict local minimum of f on Q. 

Proof. On the contrary suppose that Xq is not a strict local minimum; then there 
exists a sequence Xk such that Xk — » Xo, when fc — > +oo and /(xfc) < /(xo) VA; G N. 
So Xk = xo + 5kUk, where ||wfc|| = 1 and 5k — *• when A; — ► +oo. So we have: 

f(x k )>f(x ) + SlS^f^ 
where xq < £& < x + 5 k Uk. This implies that 

and then, when /c is "sufficiently large", we obtain: 

which contradicts the hypothesis. □ 

References 

[1] Ash J.M.: Very generalized Riemann derivatives. Real Analysis Exhange, 12, 
1985, 10-29. 

[2] Ash J.M.: Generalizations of the Riemann derivative., Tran. Amer. Math. Soc, 
126, 1997, 181-199. 



.OPTIMALITY CONDITIONS 53 



[3] 

[4] 
[5] 

[6] 
[7 



[9 



H. Brezis: Analyse fonctionelle- Theorie et applications. Masson Editeur, Paris, 
1963. 

X. Chen: Convergence of BFGS method for LC l convex constrained optimiza- 
tion. SIAM J. Control Optim., 34, 1996, 2051-2063. 

R. Cominetti, R. Correa: A generalized second-order derivative in nonsmooth 
optimization. SIAM J. Control Optim., 28, 1990, 789-809. 

A. Guerraggio, M. Rocca: Derivate dirette di Riemann e di Peano. Convessita e 
Calcolo Parallelo, Verona, 1997. 

J.B. Hiriart-Hurruty: Contributions a la programmation mathematique: deter- 
ministe et stocastique. (Doctoral thesis), Univ. Clermont-Ferrand, 1977. 

J.B. Hiriart-Urruty, J.J. Strodiot , V. Hien Nguyen: Generalized Hessian matrix 
and second order optimality conditions for problems with C 1,1 data. Appl. Math. 
Optim., 11, 1984, 43-56. 

E. Isaacson, B.H. Keller: Analysis of numerical methods. Wiley, New York, 1966. 



[10] D. Klatte, K. Tammer: On the second order sufficient conditions for C 1,1 - 
optimization problems., Optimization, 19, 1988, 169-180. 

[11] D. Klatte: Upper Lipschitz behavior of solutions to perturbed C 1,1 programs. 
Math. Program. (Ser. B), 88, 2000, 285-311. 

[12] D. La Torre, M. Rocca: C k ' 1 functions and Riemann derivatives. Real Analysis 
Exchange, 25, 1999-2000,743-752. 

[13] D. La Torre, M. Rocca: Higher order smoothness conditions and differentiabil- 
ity of real functions. Real Analysis Exchange, 26, 2000-01, 657-667. 

[14] D. La Torre: Characterizations of C 1 ' 1 functions, Taylor's formulae and opti- 
mality conditions. (Doctoral Thesis), Department of Mathematics, University of 
Milan, 2001. 

[15] D.T. Luc: Taylor's formula for C k ' 1 functions. SIAM J. Optimization, 5, 1995, 
659-669. 



54 D. LA TORRE, M.ROCCA 



[16] Marcinkiewicz J., Zygmund A.: On the differentiability of functions and sum- 
mability of trigonometrical series. Fund. Math., 26, 1936, 1-43. 

[17] P. Michel, J. P. Penot: Second-order moderate derivatives. Nonlin. Anal., 22, 
1994, 809-824. 

[18] L. Qi, W. Sun: A nonsmooth version of Newton's method. Math. Program., 
58, 1993, 353-367. 

[19] L. Qi: Superlinearly convergent approximate Newton methods for LC 1 opti- 
mization problems. Math. Program., 64, 1994, 277-294. 

[20] L. Qi: LC 1 functions and LC 1 optimization. Operations Research and its ap- 
plications (D.Z. Du, X.S. Zhang and K. Cheng eds.), World Publishing, Beijing, 
1996, 4-13. 

[21] T.D. Rockafellar: Proximal subgradients, marginal values, and augmented la- 
grangians in nonconvex optimization. Mathematics of Operations Research, 6, 
1981, 427-437. 

[22] W. Sun, R.J.B. de Sampaio, J. Yuan: Two algorithms for LC 1 unconstrained 
optimization. J. Comput. Math., 18, 2000, 621-632. 

[23] X.Q. Yang, V. Jeyakumar: Generalized second-order directional derivatives and 
optimization with C 1 ' 1 functions. Optimization, 26, 1992, 165-185. 

[24] X.Q. Yang: Second-order conditions in C 1,1 optimization with applications. 
Numer. Funct. Anal, and Optimiz., 14, 1993, 621-632. 

[25] X.Q. Yang: Generalized second-order derivatives and optimality conditions. 
Nonlin. Anal., 23, 1994, 767-784. 



Numerical analysis of a quasistatic sliding contact 

problem with wear 

Jose R. Fernandez ( x ), Mircea Sofonea 1 - 2 ^ and Juan M. Viario 1 - 1 ) 

(1) Departamento de Matematica Aplicada, Universidade de Santiago de Compostela, 

Campus Sur s/n, Facultade de Matematicas. 15782 Santiago de Compostela, Spain 

E-mail: jramon@usc.es, maviano@usc.es 

(2) Laboratoire de Mathematiques et Physique pour les Systemes (MEPS), 

Universite de Perpignan via Domitia, 

52 Avenue Paul Alduy, 66860 Perpignan, France, 

E-mail: sofonea@univ-perp.fr 

Abstract 

We consider a mathematical model which describes the sliding contact with 
wear between a viscoelastic body and a rigid moving foundation. The process 
is quasistatic, the contact is bilateral and the wear is modeled with a version 
of Archard's law. The variational formulation of the problem consists of a 
system of nonlinear evolutionary equations which has a unique solution under 
certain assumptions on the given data. We study the numerical approach to the 
problem using a fully discrete finite elements scheme with implicit discretization 
in time and we derive error estimates for the approximative solutions. Finally, 
we present numerical results in the study of one and two dimmensional test 
problems. 

Keywords: Viscoelasticity, contact, Archard's law, wear, finite elements. 



1 . Introduction 

Wear is one of the plagues which reduce the lifetime of modern machine elements. It 
represents the unwanted removal of materials from surfaces of contacting bodies oc- 
curing in relative motion. Wear arises when a hard rough surface slides against a softer 
surface, digs into it, and its asperities plough a series of grooves. Generally, a math- 
ematical theory of friction and wear should be a generalization of experimental facts 
and it must be in agreement with the laws of thermodynamics of irreversible processes. 
A general model of quasistatic frictional contact with wear between deformable bod- 
ies was derived [11], [12] from thermodynamic considerations. This model was used 
in various papers (see for example [9, 10]), where existence and uniqueness results of 
weak solutions have been proved. 
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56 The present paper is devoted to the study of the numerical analysis of a problem of 
sliding contact with wear. The process is quasistatic, and the contact is bilateral. 
We model the process as in [12] by introducing the wear function which measures 
the wear of the contact surface and which satisfies Archard's law. The variational 
analysis of the model was provided in [2]. Here we study the numerical approach to 
the problem using spatially semi-discrete and fully discrete finite elements schemes 
with implicit discretization in time. Notice that from physical point of view our 
model is quite limited and, since the processes involved in wear of contact surfaces 
are very complicated, there is a need to consider in the future more sophisticated 
mathematical models. However, we believe that the rigorous numerical analysis in 
the context of this simplified physical model may be helpful and represents a first 
step in the study of more realistic models describing the contact with wear. 

The paper is organized as follows. In Section 2 we introduce some notations and 
preliminaries. In Section 3 we present the sliding contact problem and derive the 
variational formulation. Then, we recall a version of the result obtained in [2] con- 
cerning the unique solvability of the model. In Section 4 we consider the fully discrete 
approximation of the model, by using finite elements for the spatial variable and the 
backward Euler scheme for the time variable. We state the existence and uniqueness 
of the fully discrete solution and we derive error estimates which prove its conver- 
gence to the solution of the continuous problem. Finally, in Section 5 we present some 
numerical results for one and two dimensional test examples. 



2 . Notation and preliminaries 

In this short section we present the notation we shall use and some preliminary 
material. For further details we refer the reader to [3, 6, 8]. 

We denote by S^ the space of second order symmetric tensors on IR d (d = 1,2,3), 
while " • " and | • | will represent the inner product and the Euclidean norm on E>d 



and H d , respectively, i.e. 








U ■ V = UiVi, 


\v\ 


= (v.v) 2 


Vw, v eJR' 


(J ' T Oij T{j , 


T 


= {t-t)* 


V<r, t G ! 



id- 
Here and below the indices i and j run between 1 and d, and the summation conven- 
tion over repeated indices is adopted. 

Let f2 C IR be a bounded domain with a Lipschitz boundary T and let v denote the 
unit outer normal on T. We shall use the notation 

H = [L 2 (tt)] d = {u= ( Ui ) ; Ul G L 2 (Sl) }, Q = { a- = (a %j ) ; a %j = a 3i G L 2 (fi) }, 
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H l = {u= (ui) ; e{u) EQ}, Qi = {cr eQ ; Diver E H}. 

Here e : i?i — ► Q and Div : Q\ — > H are the deformation and the divergence 
operators, respectively, defined by 
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1 



e(u) = (£ij(u)), £ij{u) = -(u iyj + u J7 i), Div cr 



cr 



1-3,3)1 



where the index that follows a comma indicates a partial derivative with respect to 
the corresponding component of the independent variable. The spaces H, Q, Hi and 
Qi are real Hilbert spaces endowed with the cannonical inner products given by 



(u,v)h= / UiVidx, (o-,t)q= / (JijTijdx, 

Jq Jn 

(u,v) Hl = (u,v) H + (£(u),e{v)) Q , (<t,t)q 1 = {ct,t) q + (Div cr, Div t) h . 

The associated norms on these spaces are denoted by | • \h, | • \q, \ • \h x and | ■ \q 1 , 
respectively. 

Let Hy = [// 5 (r)] d and let 7 : Hi — > Hr be the trace map. For every element 
v E Hi we still write v to denote the trace '-/v of v on T and we denote by v v and v T 
the normal and the tangential components of v on the boundary T given by 

v u = v ■ u, v T = v — v v v. (2.1) 

Let H T be the dual of H-p and let (•, •) denote the duality pairing between H v and 
Hy- For every a E Qi there exists an element cru E H T such that 

((T,e(v)) Q + (Div a, v) H = (01/, 7«) Vv E Hi. (2.2) 

Moreover, if cr is a regular function, then 

(01/, jv) — avvda Mv E Hi. (2.3) 



i" 



We also denote by o~ v and cr T the normal and tangential traces of cr and we recall 
that, when cr is a regular (say C 1 ) function, then 

a u = (ens) is, cr r = ens — a v is. (2.4) 



Finally, for every real Hilbert space X, we use the classical notation for the space 
L°°(0,T;X) and we denote by C([0,T];X) and C^QO.T^X), T > 0, the space of 
continuous and continuously differentiable functions from [0,T] to X, respectively. 
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3 . The model of sliding contact with wear 

In this section we describe the model for the process, present its variational formula- 
tion and state an existence and uniqueness result, Theorem 3.1. 

The physical setting is as follows. A viscoelastic body occupies in its reference confi- 
guration a domain Q with outer Lipschitz surface T that is divided into three disjoint 
measurable parts r\, 1^ and 1^, such that measTi > 0. Let [0,T] be the time 
interval of interest, where T > 0. The body is clamped on Ti x (0, T) and therefore 
the displacement field vanishes there. A volume force of density fo acts in Q x (0, T) 
and surface tractions of density f 2 act on T 2 x (0, T). On T 3 the body is in bilateral 
contact with a moving rigid foundation, which results in the wear of the contacting 
surface. We assume that there is only sliding contact, which is always maintained. 

We denote by u the displacement field, a the stress field and s(u) the small strain 
tensor. We assume that the constitutive law of the material is of the form 

(T = Ae(u) + ge(u), (3.1) 

where A and Q are given nonlinear functions which will be described below. In (3.1) 
and below, in order to simplify the notation, we usually do not indicate explicitely the 
dependence of various functions on the variables x e Q U T and t G [0, T\. Moreover, 
the dot above represents the derivative with respect to the time variable, i.e. 

du 

u = ——. 
dt 

We recall that in linear viscoelasticity, the stress tensor er = (a^) is given by 

Cjj = ciijki£ki{u) + gijki£ki{u), (3.2) 

where A = {aijki) is the viscosity tensor and Q = (gijki) is the elasticity tensor, 
for i,j,k,l = l,...,d. Kelvin- Voigt viscoelastic materials of the form (3.1) involving 
nonlinear constitutive functions have been considered recently in [9, 10]. 

Next we describe the conditions on the contact surface r% in which our interest lies. 
We denote by v* the velocity of the foundation and we assume that the tangential 
displacement on the contact surface vanishes, i.e. 

u T = 0. 

We introduce the wear function w : T 3 x [0,T] — ► 1R, which measures the wear of 
the surface, using the model derived in [11, 12]. The wear is identified as an increase 
in gap in the normal direction between the body and the foundation or, equivalently, 
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as the depth of the material that is lost. Since the body is in bilateral contact with 
the foundation it follows that 

w = —u v + Uq v (3.3) 

where u represents the initial displacement field. Thus, the position of the contact 
evolves with the wear. We remark that w(0) = i.e. at the initial moment the 
material is new. Moreover, the effect of the wear is the recession of T^ and therefore 
it is expected that u v < uq v , which implies w > 0. We conclude that the wear is 
non-negative which justifies the sign convention in (3.3). 

The evolution of the wear of the contacting surface is governed by a version of Ar- 
chard's law (see [11, 12]), which states that the rate of the wear is proportional to 
the contact stress and the slip, thus 

w = -k w a u v*, (3.4) 

where k w > is a wear coefficient and v* — \v*\ > 0. 

We can now eliminate the unknown function w from our problem. Let (3 = l/(k w v*). 
Using (3.3) and (3.4), we have 

a v = fiu v . (3.5) 

Moreover, the wear increases in time, i.e. w > and therefore, from (3.3), it follows 
that ii v < 0. Thus, the contact condition (3.5) implies 

cr u = —f3\ii v \. 

Finally, we assume that the process is quasistatic, i.e. the inertial term can be ne- 
glected in the equation of motion. 

With these assumptions, the classical formulation of the quasistatic problem of sliding 
contact with wear is the following one. 

Problem P. Find a displacement field u : Vt x [0,T] — ► IR and a stress field 
a : n x [0, T] — ► E> d such that 

<r = Ae{u) + Qe{u) infix (0,T), (3.6) 

Div<r + / = infix(0,T), (3.7) 

u = onTi x (0,T), (3.8) 

ois = f 2 on r 2 x(0,T), (3.9) 

(T v = -P\u v \, u T = onr 3 x(0,T), (3.10) 

u(0) = u in fi. (3.11) 
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60 To obtain the variational formulation of Problem P, we need additional notation. To 
this end, let V denote the closed subspace of Hi, defined by 

V = {v E Hi ; v = onTi, v T = on T 3 }. 

Since measTi > 0, Korn's inequality holds: there exists Ck > which depends only 
on Q and Ti such that 



e(v)\a > C K \v 



\Q 



K\V\Hi 



VveV. 



(3.12) 



A proof of Korn's inequality may be found in [7, p. 79]. On V, we consider the inner 
product given by 

(u, v) v = (e(u), e(v)) Q Vtt, v G V, 



and let I • \v be the associated norm, i.e. 



\v\v = \e(v) 



\/veV. 



(3.13) 



It follows from (3.12) and (3.13) that \ ■ \h-i_ and \ ■ \v are equivalent norms on V and 
therefore (V, | • |y) is a real Hilbert space. Moreover, by the Sobolev's trace theorem 
and (3.12) we have a constant Cq depending only on the domain Q, Fi and T 3 such 
that 

\v\ [L 2 {r:i)]d < C \v\ v VveV. (3.14) 



In the study of the frictional problems with wear, we assume that the viscosity and 
elasticity operators satisfy 



(b) There exists C A > such that 

\A(x, ei) - A(x, e 2 )\ < C A \e 1 - e 2 | 
Vei,£ 2 £ §d) a - e - x e Q. 

(c) There exists m A > such that 

(A(x, ei) - A(x, e 2 )) ■ (ei - e 2 ) > m„4 |ei - e 2 | 2 
Vei,e 2 ^ Sd> a - e - sgO. 

(d) For any e G S^, as i — »■ *4(x, e) is Lebesgue measurable on Q. 

(e) The mapping a; i— ► A(x, 0) G Q- 



> (3.15) 



(b) There exists an £g > such that 

|<?(aj, ei) - Qix, e 2 )| < £<? |ei - e 2 | 
Vei,e 2 G §d, a.e. a?Gf2. 

(c) For any e G § d , x i— > Q{x, e) is Lebesgue measurable on Q. 

(d) The mapping x \— ► £/(cc, 0) G Q- 



> (3.16) 
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We note that assumptions (3.15) and (3.16) are satisfied in the case of the linear 61 
viscoelastic model (3.2) provided that a^u G L°°(Q), g^i G L°°(Q) and there exists 
a > such that 

aijki{x) ikii>a ||| 2 V£ G S d , a.e. a; G Q. 

We also assume that the body forces and tractions have the regularity 

f eC([0,T];H), f 2 eC([0,T];[L\T 2 )] d ), (3.17) 

and the function /3 is such that 

(3 G L°°(r 3 ) and there exists (3* such that f3(x) > /3* > 0, a.e. a? G T 3 . (3.18) 
Finally, the initial displacement satisfies 

u G V. (3.19) 

Next, we define the element f(t) G V given by 

(/(t),«)v = (/o(t),v)H + (/2(t),t;) [ ^(r 2 )]- VveV, tG[0,T], (3.20) 

and let j : V x V — > 1R be the functional 

j(u,v) = / /3|-u^|f^(ia Vm, d G F. (3-21) 

Jr :i 

Keeping in mind (3.17) and (3.18) we note that the integrals in (3.20) and (3.21) are 
well defined. Moreover, using (3.20) it follows that 

feC([0,T];V). (3.22) 

Using (2.1)-(2.3) and (2.4), it is straightforward to show that if {u, cr} represents a 
regular solution to Problem P then 

(cr(t),e(v)) Q +j(u(t),v) = (f(t),v) v VveV, te[0,T}. (3.23) 

To conclude, from (3.6), (3.11) and (3.23) we obtain the following variational formu- 
lation of the mechanical problem P. 

Problem PV. Find a displacement field u : [0, T] — > V and a stress field cr : [0, T] — > 
Q such that 

cr(t) = Ae(u(t)) + Qe(u(t)) Vt G [0, T], (3.24) 

(<r(t),e(v)) Q +j(u(t),v) = (f(t),v) v VveV, tG[0,T], (3.25) 

u(0) = u . (3.26) 
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62 The well-posedness of this problem is given by the following result. 

Theorem 3.1. Assume that (3.15)-(3.19) hold. Then, there exists (3q > which 
depends only on Q, r 1; T 3 and A such that Problem PV has a unique solution {u, <x} 

if 

|^U~(r 3 ) < A). (3.27) 

Moreover, the solution satisfies 

ueC\[0,T];V), <reC([0,T];Q 1 ). 

The proof of Theorem 3.1 is carried out in [2] in several steps, based on classical results 
for time-dependent nonlinear equations with strongly monotone operators followed by 
a fixed point argument. In the rest of the paper, we assume the conditions stated in 
Theorem 3.1 are satisfied so that contact problem PV has a unique solution. 

We end this section with the remark that if v* is large enough then /3 = l/(k w v*) is 
small enough and therefore condition (3.27) which guarantees the unique solvability 
of Problem PV is satisfied. We conclude that the mechanical problem (3.6)-(3.11) 
has a unique weak solution if the velocity of the foundation is large enough. 



4 Fully discrete approximation 

In this section we consider a fully discrete approximation of Problem PV. Let V h C V 
and Q h C Q denote finite dimensional spaces used to approximate the spaces V and Q 
satisfying e(V h ) C Q h . Here h > is a discretization parameter. Let Vqh : Q — *■ Q h 
be the orthogonal projection operator defined through the relation 

(V Qh q,q h ) Q = (q,q h ) Q V q E Q, Vq h E Q h . 

The orthogonal projection operator is non-expansive, i.e. 

\VQhq\ Q < \q\ Q WqEQ. 

This property will be used on various occasions. 

In addition, we need a partition of the time interval [0, T] : = t <ti<...<t^ = T. 
We denote the step size k n = t n — t„_i for n = 1,2, ... ,N and let k = max k n be 

n 

the maximal step size. For a sequence {w n }^ =0 , we denote Aw n = w n — u>„_i for the 
difference and 5w n = Aw n /k n the corresponding divided difference. In this section 
no summation is considered over the repeated index n and c will denote positive 
constants which are independent of the parameters of discretization h and k. 
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The fully discrete approximation method is based on the backward Euler scheme. It 63 
has the following form. 

Problem PV hk . Find u hk = {wf }n=o C V h and a hk = {<xf }^ =1 C Q h such that 

a h n h = V Qh Ae(6u h n k ) + V Qh Qe{u hk ), (4.1) 

(a h n k ,e(w h )) Q +j(5u h n k ,w h ) = (f n ,w h ) v Vw h eV\ (4.2) 

< = <■ (4-3) 

Using similar arguments to those employed in [2], we have the following. 

Theorem 4.1. Assume that (3.15)-(3.19) and (3.27) hold. Then Problem PV hk has 
a unique solution. 

Remark 4.2. In practice, the fixed point algorithm used to show the existence and 
uniqueness of solution for Problem PV hk is directly applied. To solve the semilinear 
equality obtained, a penalty-duality algorithm introduced in [4] is suggested. Details 
are given in Section 5. 

We now derive error estimates for the fully discrete solution. Let n e {1,2, ...,7V} 
and let er n = er(t n ), v n = u(t n ), u n = u(t n ), v 1 ^ = 5u^ k - By integration of (3.24) we 
get 

cr(t) = Ae(v(t)) + Qe{ f v{s)ds + u ) Vt e [0, T], (4.4) 

Jo 

where v(t) = ii(t). Substracting now (4.4) at t — t n to (4.1), we obtain 
a n - erf = (/ - V Qh )Ae(v n ) + V Qh (Ae(v n ) - Ae(v h n k )) 

+(/ - V Qh )ge{u n ) + V Qh [Ge{ / v(s)ds + u ) - g e (V kjvf + v%)], 

where / is the identity operator on Q. 
Thus, 

\cr n - erf | Q < \(I - V Qh )Ae(v n )\ Q + c\v n - v h n k \v + |«o - u h Q \ v 

+ \(I-V Qh )ge(u n )\ Q + \ v{s)ds-y^k j vf\ v . (4.5) 

Jo j=1 

Plugging (4.4) into (3.25) at time t = t n , (4.1) into (4.2) and substracting the obtained 
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64 equalities, we deduce that 

(Ae(v n ) - Ae(v™), e(w h )) Q + j(v n , w h ) - j(v h n k , w h ) = 

-{Ge{ v(s)ds + u )-ge(y2k J v ! ; k + u h ),e(w h )) Q Vw h eV h . 

Therefore 

(Ae(v n ) - Ae(v%°), e(y n ) - e(v™)) Q + j(v n , v n - vf) - j(v^ k , v n - v* k ) 
= (Ae(v n ) - Ae(v™), e(v n ) - e(w h )) Q + j(v n , v n - w h ) - j(v™, v - w h ) 

rt n n 

+{Ge( / v(s)ds + wo) - Ge(J2 Mf + «£), ^K - w h ))q 

-(Ge( / v(s)ds + « ) - ge( V kjvf + u h ), e(v n - v h n k )) Q \/w h e V h . 
Jo j=1 

It can be verified that 

(As(v n ) - Ae(v™), e(v n ) - e(v h n k )) Q +j(v n , v n - vf) - j(v^ k , v n - v hk ) 

>(m A -C 2 (3 )\v n -v h n k \ 2 v , 
where /3q is chosen in such a way that m A — Cqj3q > 0, and Cq is given by (3.14). 

After some algebra, we then obtain the inequality 

n 

\v n -v h n k \ v <c{\v n -w h \ v + I n + Y,k 3 e^ Vw h eV h , (4.6) 

3=1 

where 

rt n n 

I n — I / v(s)ds - y^kjVjlv, ej = \Vj - v^ k \ v . 
Jo j=1 

The following lemma was proved in [5] . 

Lemma 4.3. Assume {g n }n=i an d { e n}n=i are t wo sequences of non-negative num- 
bers satisfying 

n 
e n S Cg n + C y KjCj-i 
3=1 

then 

max e n < max q n . 

l<n<N l<n<N 

Moreover, for the quadrature error I n we have 

In -^2 \ v ( s ) - v j\vds < ct n k\v\ L <x>{o,T;V) (4.7) 
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if v G L°°(0,T;V). Applying Lemma 4.3 to (4.6) we obtain the following error 65 
estimate: 



max \v n — v^ k \ v < c( klvlTocfoT.v) + max inf \v n — w 

Kn<N \ K ' ' ' Kn<N w h GV h 



h 

'V 

w h &v 



Using now the definitions of v n and v„ we find 



\u n - u hk \ v < c{I n + max \vj - V% k \ v + |tt - V%\ v ). (4.8) 

l<j<N 

Therefore, from (4.5), (4.7) and (4.8) we obtain the following result. 

Theorem 4.4. Let {u,ct} e C 1 ([Q,1 i \;V) x C([0,T];Qi) be the solution of Problem 
PV and {(u^ k , cr^ k )}^ =0 C V h x Q h be the solution of fully discrete problem PV hk . 
Assume conditions (3.15)-(3.19) and (3.27) hold. Let us suppose u e L°°(0,T;V). 
Then we have the following error estimate: 

max {\u n - u hk \ v + \<r n - trf\ Q } < c(\u - u% 

\<n<N \ 

J rk\u\L 00 (QT-y\ + max inf \ix n — w h \v 

l<n<N w h eV h 

+ ™n<N {liI " v Q h )Mun)\ Q + |(/- V Qh )Qe{u n )\ Q }). 
As a consequence of Theorem 4.4 we obtain the following result. 

Corollary 4.5. Assume that conditions (3.15)-(3.19) and (3.27) hold. Then under 
the additional assumptions 

\u -u%\ v <ch, inf \w - w h \ v < ch Vw e V n [H 2 (tt)] d , 
w h ev h 

\(I-V Qh ){r)\ Q <ch VteQ, uEL™(0,T;{H 2 (n)] d ), 

we obtain the following error estimate: 

max {\u n - u hk \ v + \(T n - erf | Q } < c(h + k). 

Kn<N 



5 Numerical results 

In order to show the perfomance of the numerical method described in the above 
section, some numerical experiments have been done in the study of one and two 
dimensional test problems. In this section, we describe the complete algorithm to solve 
Problem PV hk and we resume some numerical results which exhibit the perfomance 
of the algorithm. 
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66 5.1 Algorithm to solve Problem PV hk 

Sustituting (4.1) into (4.2), we obtain for n — 1, 2, . . . , N: 

(P Qh [Ae(5u h n k ) + ge(u^%e(w h )) Q +j(5u h n k ,w h ) = (f n ,w h ) v \fw h G V\ (5.1) 

where u^ k = Uq is given. 

Using the properties of the orthogonal operator Pqh defined in Section 4 and defining 
again v^ k = Su 1 ^, we obtain that (5.1) is equivalent to: 

(Ae(v h n k ) + Qe[u hk _^ + k n v h n k ),e(w h )) Q + j(v h n k ,w h ) = (f n ,w h ) v \Iw h e V h . (5.2) 

Then, the element u^ k is obtained by 

u>n=< k -i + k n v™, 71 = 1,2,..., N. 

In order to solve problem (5.2), the first step is to apply the fixed point argument 
introduced in [2] to prove the existence and uniqueness of solution. 

For r]Q k G Q h given, let {(v<nhk, T]^: k )} r >o C V h x Q h be the sequence obtained solving 
problems (5.3)-(5.4) described below: 

v n hk G V h , ] 

lr } (5.3) 

(Ae(v r]hrk ) + ri k ,e(w h )) Q +3(v r)hrk ,w h ) = (f n ,w h ) v Vw h eV\ J 

Vr+i = ^(«f-i + knVrjhk). (5.4) 

We have (see [2]), 

lim v n hk = v^ k . 



To solve the semilinear equality (5.3) we used a penalty-duality algorithm introduced 
in [4]. In order to describe it, we remark that 



j(u,v) = / (5\u v \v v da — I f3(u„ — u v )v v da — I (3u„v u da— / (3u u v u da, 

where u+ = max{-u„, 0} and u~ = mm{u v , 0}. 
Let E h C L 2 (T 3 ) and K h C E h be defined by 

E h = {q h e L 2 (r 3 ) ; q\ c G P^C) VCe9 h }, 
K h = {q h G E h ; gf 1 < in C VC G 9 h }, 
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where 6 h is the induced triangulation of T h on T%. 
Let B : V h — > E h be the operator 

Bv h = v h v \Jv h e V h , 
and let B* : E h — ► V ft be the operator given by 

3q h v h v da \/v h e V h , Vg h G £ h . 
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(BW)v 



r 3 



It is well known that problem (5.3) is equivalent to minimize on V h the functional 
J : V h -»• R defined by 

j(^) = /(</)-<?(£ A 

where / : \^ ft — ► (—00, 00] and (7 : -E ft — > (—00, 00] have the following form, 

f(v h ) = \{Ae{v% e(v h )) Q + \ J (3[(v h u ) + ] 2 da - (/„, v h ) v - (r,f , e(v h )) Q , 



9(q h ) 



pwyyda. 



According to [4], the following iterative algorithm is considered: 

Given p$ k £ E h and A > 0, let {(v^f ,p" )} m >o C V h x E h be the sequence obtained 
solving the following problems: 



B* P hk e df(v 



m+lJi 



p h m k +1 = g' x (Bv h n k +l + \p h J:), 



(5.5) 
(5.6) 



where df denotes the subdifferential operator of f and g' x is the Yosida 's approxima- 
tion of the maximal monotone operator dg. Then (see [4]): 

lim v™ = v^hk. 

rn — >m »r 



We can see that problem (5.5)-(5.6) is equivalent to the following. 

(Ae(v h m k +1 ),s(v h -v h m k +1 )) Q + 1 -^ PH^da- 1 -^ P[(v h m k +1 )t} 2 da 

> (/„, V h - V hk +l )y - (n h r k , S(v h - < fe +1 ))Q 



.h r- \rh 



+ / X K - (<Vi),) da Vv* e V 
'r 3 



) (5-7) 



hk 
fm+1 



[Bv h m k +1 + \p h m k )-. 



(5.8) 



NUMERICAL ANALYSIS OF A QUASISTATIC SLIDING... 



68 Finally, problem (5.7) is similar to the normal compliance contact problem in elasticity 
(see [13]). To solve it, we used the following penalty-duality algorithm previously 
introduced in [14]: 

Given qft k G E h and w G R, w > 0, let {(vf fc , q^ k )}i>o C V h x E h be the sequence 
obtained solving the following problems: 

v hk e yh 



(Ae(v? k ),e(w h )) Q +u f (5{vf) v w h v da = (f n ,w h ) v 

Jr 3 

-(rjM,e(w h )) Q + [ Pp h m k w h u da- [ f3q? k w h u da Vw h eV h , 



} (5-9) 



~hk = 2(v hk 



hk 

Hl+l 



I )u 

uJ 



U> 



: ( h 



lik 



(5.10) 



[(l-co)q^-2P Kh (Qi% 



where P K h is the orthogonal L 2 (r 3 )-projection operator over the convex set K h . We 
obtain (see [14]): 

lim vf = v h m k +1 . 



5.2 A one-dimensional test-problem 

We consider a viscoelastic rod Q = (0, L) which is fixed at its left end x = and is 
subjected to the action of a body force of density fo(x, t) in the x-direction (see Figure 
1). Its right end x = L is in contact with a rigid moving obstacle which produces the 
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Figure 1: Test 1: Sliding contact of a viscoelastic rod. 

wear of the rod. This problem corresponds to Problem P with Q = (0, L), Ti 
T 2 = 0, T 3 = {L}. We use a linear viscoelastic constitutive law, i.e. 

a = ae(u) + ge{u). 



{0}, 
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Here e{u) = — , while a and g are material constants independants on x and t, a > 0. 69 
A complete description of this problem is the following one. 

Problem T1D. Find a displacement field u : [0, L] x [0, T] — »■ K. and a stress field 
a : [0, L] x [0, T] -»• R snc/i that: 

d 2 u(x,t) du(x,t) , mN 

a(x,t) = a^_Li + ^^^ in(0,L)x(0,T), (5.11) 

,>T(! n =/o(x,«) in(0,L)x(0,T), (5.12) 



dx 

u(0,t) = for te (0,T), (5.13) 

a(l,t) = -/3|^M| fortG(0,T), (5.14) 

u{x,0) = u (x) in (0,L). (5.15) 



For computation we have used the following data: 

L = lm, T = 10s., a = 100 N -s/m, g = lN/m, 

fo(x, t) = 99Ae- 1 \/x e (0, 1), te [0, 10], 

= 10~ 4 TV • s/m, u (x)=A^- + x Vx6(0,l), 
9899.99 
~ ~ 9899.995' 

The exact solution of the above problem is 

u(x, t) = (A 3 - + x)e~\ (5.16) 

a(x,t) = -99(Ax + l\e- t . (5.17) 

We used a discretization by continuous piecewise affine functions for the space V h 
and by piecewise constant functions for the space Q h . Since £{V h ) C Q h , in (4.1) no 
projection is needed. 

In Table 1 we provide the exact error values for several discretization parameters h 
and k. We note that the asymptotic behavior 

max (\u n - u™\v + \(T„ - erf | Q ) < c(h + k), 

Kn<N 
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ki\h- 


0.1 


0.05 


0.025 


0.01 


0.005 


0.0025 


0.001 


0.1 


0.026276 


0.013411 


0.007236 


0.004078 


0.003392 


0.003198 


0.003142 


0.05 


0.027474 


0.013749 


0.006910 


0.002862 


0.001593 


0.001061 


0.000855 


0.025 


0.028167 


0.014079 


0.007041 


0.002822 


0.001422 


0.000703 


0.000349 


0.01 


0.028594 


0.014291 


0.007145 


0.002858 


0.014210 


0.000715 


0.000287 


0.005 


0.028738 


0.014363 


0.007181 


0.002872 


0.001436 


0.000718 


0.000287 


0.0025 


0.028809 


0.014399 


0.007199 


0.002879 


0.001439 


0.000719 


0.000287 


0.001 


0.028853 


0.014421 


0.007209 


0.002883 


0.001442 


0.000721 


0.000288 



Table 1: Problem T1D: Exact error values. 



derived in Corollary 4.5, is obtained with the constant c p» 0.13874157, independent 
of h and k (see Figure 2 (left-hand side)). The evolution of the wear, obtained from 
(3.3), is shown on the right-hand side of Figure 2. For computations we used here 
the values k = h = 0.01 . 



Error estimate for |U-Uh 



Evolution of the wear function through the time 




Figure 2: Problem T1D: Asymptotic constant of error and evolution of the wear function through 
the time. 



5.3 A two-dimensional test problem 

As a two-dimensional example of Problem P, we consider the plane stress viscoelas- 
ticity problem for an isotropic material shown in Figure 3. The physical setting 
corresponds to a three-dimensional linear viscoelastic body of cross-section Q = 
(0, 6) x (0, 6) submitted to the action of vertical forces in such a way that a plane stress 
hypothesis is assumed. A linearly decreasing traction force is supposed to act on the 
part T 2 = [0, 6] x {6} of the boundary and no body forces act in Q. The horizontal 
displacements on the lateral surface are supposed to vanish, i.e. Ti = {0, 6} x [0, 6]. Fi- 
nally, the body is in sliding contact with a rigid moving foundation on T3 = [0, 6] x {0}. 
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Figure 3: Test 2: Contact of a 2D viscoelastic body. 
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A permanent contact with the foundation is maintained and the wear is then pro- 
duced. We assume the wear coefficient to be constant on r% and therefore the data 
j3 is constant on T 3 too. 



The elasticity tensor Q is then defined by: 
{Gr) a /3 



Ek . E 

: 5- (Til + T 2 2)d a (3 + — — r a(3 , 1 < a, (3 < 2, 

I — K z 1 + K 



where E is the Young's modulus, k the Poisson's ratio of the material and S a /3 denotes 
the Kronecker symbol. The viscosity tensor A has a similar form, i.e. 



(Ar) al3 = n(Tu + T 22 )S a p + T]T ( 



a/3, 



1 <a,P< 2, 



where \x and rj are viscosity constants. Recall also that the von Mises norm for a 
plane stress field r = {r a p) is given by 



i 

2 2 i o 2 \ 2 

T ll + T 22 - r ll r 22 + OT 12 



For computation we have used the following data: 

T = ls, f = 0N/m 3 , f 2 (m, x 2 , t) = (0, -10(6 - Xl )t)N/m 2 , 

cr = A^/m 2 , u = 0m, (3= 1CT 2 iV • s/m 2 , 

E = 100N/m 2 , k = 0.3, \i = 32.967iV • s/m 2 , rj = 23.077iV • s/m 2 . 
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72 As in the one dimensional test problem, we used continuous piecewise affine functions 
to approximate the space V and piecewise constant functions to approximate the 
space Q. Again, e(y h ) C Q h and therefore no projections are needed in the discrete 
constitutive law (4.1). Moreover, for the time discretization, we used a uniform time 
step with k = 0.01. 

The amplified deformed mesh, the von Mises norm of stress at final time T = 1 s 
and the initial boundary are shown in Figure 4. We used again (3.3) to obtain the 
wear function. In Figure 5 we show the wear of the contact boundary for several time 
values as well as the evolution of the wear of the central contact node x = (3, 0). 




Wasted material 



49.32 
47.09 



Figure 4: Amplified (by 60) deformed mesh and von Mises stress norm at final time T = Is and 
initial boundary. 



Evolution of the wear function amplified by 60 



Amplified evolution of the wear of the central contact node 



1=0.2 

- t=0.4 
* t=0.6 

- - t=0.8 




Figure 5: Wear of the contact boundary for several time values amplified by 60 and evolution of 
the wear of the central contact node. 
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Abstract 

For each integer n > 2, we construct a subset S n of R and characterize 
all wavelets of L 2 (R) whose Fourier transform is supported in S„. This 
result generalizes the characterization of a class of wavelets given by E. 
Hernandez and G. Weiss. Wavelets are usually constructed by using a 
multiresolution analysis (MRA); but there are wavelets which cannot be 
constructed through an MRA. We show that the wavelets associated with 
S n are non-MRA wavelets, if n > 3. The dimension functions of these 
wavelets are also computed explicitly. 
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1 Introduction 

A function tp G L 2 (M) is said to be a wavelet if the system of functions {ipj,k '■ 
j, fceZ} forms an orthonormal basis for L 2 (M), where 

tp j>k (x) = 2 j / 2 tp(Vx-k), j,kez. 

A function tp <G L 2 (M) is a wavelet of L 2 (M) if and only if it satisfies the 
following four conditions: 



^|^(2 J '£)| 2 = 1 fora.e. £ e R. (1) 

jez 

^ tp(2 j £)tp(2i(£ + 2m7r)) = for a.c. £ G R and for all m G 2Z + 1. (2) 



j>0 



^]|^(C + 2fc7r)| 2 = 1 fora.e. (el. (3) 



A-t- 



^ V>(£ + 2k-n)xp{23 '(£ + 2kn)) = for a.e. £ € R and for all j > 1. (4) 

fcez 

We use the following definition of the Fourier transform 



/(0 = / /(^e-^cte, £ G 



The fact that equations (l)-(4) characterize all wavelets of L 2 (M) was ob- 
served by many authors. Lemarie and Meyer [7] obtained these equations for 
compactly supported wavelets. Bonami, Soria and Weiss [1] proved this fact un- 
der the assumption that tp is band-limited, i.e., -0 is compactly supported. For a 
function ip to be a wavelet, it is necessary that \\1pW2 = 1- With this assumption 
on ip, the following theorem was proved independently by G. Gripenberg [2] and 
X. Wang [9] (see also [3]). 

Theorem 1. Let tp G L 2 (R) with \\ip\\ 2 = 1. T/ien %p is a wavelet of L 2 (R) if 
and only ifip satisfies (1) and (2). 

For a proof of this theorem we refer to Chapter 7 of [4]. It was observed 
in [1] that the orthonormality of the system {ipj t k '■ 3, k € Z} is characterized by 
(3) and (4), and completeness by (1) and (2) (see also [4]). A consequence of 
Theorem 1 is that equations (1) and (2), along with the assumption \\1pW2 = 1, 
imply the equations (3) and (4). 

If tp is a wavelet, then the support of tp must have measure at least 27r. This 
minimal measure is achieved if and only if \tp\ is the characteristic function of 
some measurable subset K of R. Such a wavelet is called a minimally supported 
frequency (MSF) wavelet and the associated set K is called a wavelet set. We 
refer to [4] for proofs of the above statements. 

A function is said to be band-limited if its Fourier transform has com- 
pact support. The simplest example of a band-limited wavelet is the Shan- 
non wavelet whose Fourier transform is the characteristic function of the set 
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S = [— 2-7T, — 7r]U [7r, 2tt]. Lemarie and Meyer ([6], [8]) constructed a band-limited 
wavelet belonging to the Schwartz class. This wavelet satisfies ?/>(£) = e^/ 2 6(£), 
where b is an even, non-negative "bell-shaped" function and its support is equal 
to the set [— §7r, — |7r] U [|7r, |7r]. In [1] the authors characterized all wavelets 
ip such that \ip\ is an even continuous function whose support is equal to the 
set [— 2n — e' , —it + e] U [tt — e,2n + e'], where e, e' > and e + e' < n, so 
that this class includes the Lemarie-Meyer wavelet as a particular case. Later, 
in [5] (see also Theorem 4.1, Chapter 3 of [4]), the restriction on ip (i.e., \ip\ 
is even and continuous) was removed and all wavelets with Fourier transform 
supported in [— §7r, — §71"] U [§7r, §7r] were characterized. (In fact, they charac- 
terized all wavelets with Fourier transform supported in the set [—fa, 4-7T — |a], 
< a < 7r. It was also shown that for such a wavelet ip, it is necessary that ip = 
a.c. on [— |a, 2tt — |a].) In this article, we generalize this characterization: we 
construct a set S n for each n > 2, so that 52 is the set [— §7r, — §7r] U [|7r, §7r], 
and characterize all wavelets whose Fourier transform is supported in S n . 

The article is organized as follows. In section 2 we construct a set S n for 
each n > 2, which is symmetric with respect to the origin and is a union of 
four intervals. In section 3 all wavelets ip with Fourier transform supported in 
S n are characterized, and a necessary and sufficient condition is given for l^ 
to be an even function. Next, in section 4 we show that none of these wavelets 
is associated with an MRA. For a wavelet to be associated with an MRA, it 
is necessary and sufficient that the corresponding dimension function is equal 
to 1 a.e. In section 5 we compute the dimension functions explicitly and show 
that each of these functions is even and assumes all integral values from to 
the maximum value it attains. This gives an alternative proof of the fact that 
these wavelets are not MRA-wavelets, and in addition, provides an example of 
a family of dimension functions. 



2 Construction of the set S n 

Let n > 2. Put 

®n 2 n — 1 ^"' n ®n 2^ — 1 ^"' 

_ 2 "- 1 (2"-2) , _„„ _ 2 2 "' 1 

C n 2^ — 1 ^"' n — ^ n — 2 n — 1 ^ ' 

Define S n = S„ U S~ , where S„ — [a n , b n ] U [c„, d n ] and S~ = — (S„). 

Remark 1. 1. If n = 2, then 02 = §7r, &2 = C2 = §7r and <i2 = §7r. So ^2 is 
the set associated with the Lemarie-Meyer wavelets. 

2. As n increases, the sets [a„,6„] move closer to whereas [c n ,rf„] move 
farther away. Also observe that the measure of the set S n is 2{ ^ n _ 1 iT + 
2n _ 1 7r) which approaches 37r as n tends to infinity. 

All terms in the equations (l)-(4) that characterize all wavelets of L 2 (R) 
involve dilations by powers of 2 and translations by integral multiples of 2ir. In 
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the following lemma, for various subsets of S n , we identify those translates and 
dilates which are possibly in S n . 

Lemma 1. For n > 3, let S n be as above, and let e n — |s~y7r. 

(i) If £, G [a„, e n ], then £ + 2kn e S n iff k = 0, and 2 J £ £ S n iff j = 0. 

(ii) IfS, £ [e„,6„], thenS,+2kn £ 5„ iff fc = 0,-1, and2 j £ £ S„ ijffj = 0,n-l. 

(in) If$£ [c n ,d n ], then £ + 2/ctt £ S n iff k = 0,-2"- 1 , anrf 2^ e S 1 ™ iff 
j = 0,-(n-l). 

(w,) If £ E [—e n , —a n ], then £ + 2kir <G S^ iff fc = 0, and 2 J £ € S n iff j = 0. 

M If £ £ [-&„, -e„], i/ien £_ + 2kn e S n iff k = 0, 1, and 2^ G 5„ z/f 
j =0,n-l. 

(wj //Ce [-d„,-c„], tfien £ + 2fc7r e S„ iff fc = 0.2"- 1 , and 2^ e S„ iff 
j = 0,-(n-l). 

Observe that 

[e„,6 n ] - 27r= [-&„,-e n ], 2 n ~ 1 [e n ,b n ] = [c„,d„], (5) 

and 

2"- 1 ([e„,M -2tt) =2 n - 1 [-6„,-e„] - M„,-c„]. (6) 

Proof. Let £ e [a„,e„]. If 1 < fc < 2"~ 2 - 1 then £ + 2fc?r G [b n ,c n ], and 
if fc > 2" -2 then £ + 2fc7r > d n . In either case, £ + 2fc7r ^ supp ^i>. Similarly, 
if -2"~ 2 < fc < -1, then £ + 2/br £ [-c n ,-b„], and if fc < -2™~ 2 - 1, then 
£, + 2kir < —d n . Thus, we get £ + 2fc*7r £ S n if and only if fc = 0. 

Now, let £ £ [e n ,b n ]. In this case, if 1 < fc < 2™~ 2 - 1 then £ + 2fc?r £ 
[b n ,Cn], and £ + 2fc?r > d„ if fc > 2"" 2 . Also, if -2™~ 2 < fc < -2, then 
£ + 2fc?r £ [-c n , -b n ], and if fc < -2™~ 2 - 1, then £ + 2fc?r < -d n . This leaves 
fc = — 1, in which case £ + 2fc7r £ [—b n ,—e n ] which is a subset of S n . Thus, 
£ + 2fc7r £ S n if and only if fc = 0, — 1. 

Finally, let £ £ [c n ,d n ]. If fc < 1 then £ + 2/br > d„. If -2™- 2 + 1 < 
fc < -1, then £ + 2fc?r £ [b n ,C n ]. Now, £ + 2kir £ [-a n ,a n ] if fc = 2™~ 2 , and 
£ + 2fc7r £ [-Cn, -b n ] if -2"- 1 + 1 < fc < -2™- 2 - 1. Also, £ + 2fc?r £ [-d„, -c n ] 
if fc = -2™" 1 , and if fc < -2™- 1 - 1 then £ + 2fc?r < -d„. Thus, in this case, 
£ + 2fc?r e 5„ if and only if fc = 0, -2™~ 2 . 

We have proved the statements for the translations in (i)-(iii) of the lemma. 
The proof for the dilations is similar, and (iv)-(vi) follow by the symmetry of 
the set S n . □ 

3 The main result 

The following theorem characterizes all wavelets ip of L 2 (R) such that the sup- 
port of i/j is contained in the set S n . The case n = 2 is Theorem 4.1 of Chapter 3 
in [4] in which case the wavelets are MRA- wavelets. 
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Theorem 2. Let n > 2, tp e L 2 (R), supp $ C 5„ and 6(0 = 1^(01- ™- e?1 "0 
is a wavelet for L 2 (M) if and only if 



(i) 6(0 = 1 /or a.e. £ G [o„, e„] U [-e„, -a„], 
r»; & 2 (0 + fe 2 (2 n - 1 C) = 1 /^ a.e. £ G [e„, 6„], 
fmj 6 2 (0 + 6 2 (£ - 2tt) = 1 /or a.e. £ G [e„, 6„], 
(wj 6(C) = 6(2 n - 1 (e-27r)) /or a.e. £G[e„,6 n ], 
(uj V>(0 = e lB ^b(S,), where 9 satisfies 

6(0 + 9 (2"- 1 (C - 2tt)) - 9(i - 2tt) - 0{2 n ~ 1 i) = (2m(0 + 1) n, 
for some m(0 G Z, /or a.e. £ G [e„, 6„] n (supp 6) n ( 2 „Li Supp 6). 
Moreover, if n > 3, i/ien none of these wavelets is associated with an MRA. 

Remark 2. 1. It follows from Theorem 2 that |-0| is completely determined 
by its values on [e n , b n ]. On this set, let 6 = \ip\ be an arbitrary measurable 
function taking values between and 1. Using equations (5) and (6), \ip\ 
can be extented to other sets of S n with the help of properties (i)-(iv). 
Properties (ii), (iii) and (iv) can also be written as 

b2 (0+b 2 (^t) = 1 for a.e. t;£[c n ,d n ], (7) 

6 2 (0 + 6 2 (£ + 2tt) = 1 fora.e.^K,-e B ], (8) 

b(^ ZI £ + 2n\ = o(0 for a.e. [-d n ,-c n \. (9) 

2. If (supp 6) n ( „„ 1 _ 1 supp b) has an empty interior in [e n , b n ], then 9 can be 
chosen to be any measurable function. In particular, we can take 0(0 = 0. 

Proof of Theorem 2. First let us assume that ip is a wavelet for L 2 (R), 
supp i/j C S n and b = \ip\. So ip satisfies the equations (l)-(4). Consider 
equation (3): 

^6 2 (£ + 2/c7r) = 1 for a.e. £ e R. 

fcez 

We use Lemma 1 to pick out the non-zero terms in this sum. If |£| G [a„,e n ], 
then only k = will contribute to the sum. So we get 6(0 = 1 a.e., which is (i) 
of the theorem. For £ G [e„ , 6„] , we get non-zero contributions from fe = and 
fc = — 1. That is, 6 2 (0 + 6 2 (£ — 2-7r) = 1 for a.e. £, which proves (iii). Also if 
£ G [e„,6 n ], then 2™~ 1 £ G [c„,d„], and we get (non-zero terms now correspond 
to k = and fc = -2"- 1 ) 

ft2(2»- 1 ^) + 6 2 (2 n - 1 ^-2 n 7r) = l for a.e. £e [e„,6„]. (10) 
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This can also be written as 

b 2 (0 + b 2 (Z-2 n ir) = l fora.e. £e[c„,d n ]. (11) 

Now consider equation (1): 

^V(2 J £) = 1 fora.e. ^eR. 
jez, 

For £ G [e n ,f)„], the only possible non-zero terms correspond to j = and 
j = n — 1. So we conclude 6 2 (£) + 6 2 (2" -1 ^) = 1. This proves (ii). Combining 
(ii) and (10) we get condition (iv). 

It remains to prove condition (v) . For this purpose we consider equation (4) 
with j = n — 1: 



^i/>(C + 2fc7r)^(2"- 1 (e + 2fc7r)) = fora.e. (eR. 



(■t2 



If £ G [e„, b n ], the non-zero terms in the sum come from k = and —1. Hence, 



^(e)V(2"- 1 O + ^(e-2^)V(2"- 1 (C-27r))-0 fora.e. £e[e n ,6 n ]. (12) 
Using (iii), (10) and (12), we see that for almost every £ G [e n ,&„], the vectors 

(V>(0,^-2tt)) and (^^O.^""^ - 2tt))) 

are orthonormal in C 2 . If we let VK£) — e l6 *^&(£), then it follows that, for some 
real-valued measurable function a, 

= ^_ e -^(2— 1 (€-2-)) fo(2 r,-i (€ _ 2 TT)),e- t9( - 2n "^b(2 n - 1 0) (13) 
for a.e. £ G [e„,6 n ]. But from (ii), (iii) and (iv), we know that 
6(2 n " 1 = &(£ - 2ir 



b(0 = fe(2"- 1 (^-2^)) 
So (13) can be written as 



for a.e £ G [e„,6„]. 



(14) 



*(C) e ie(«) + e -^(2— 1 («-2 7r )) 



KO = o 



e »a(|) e »0(|-2ir) 



This shows that 



-ie(2"- 1 e) 



6(2"-^) 







for a.e. £ G [e„,6„]. 



-ia(S) = e i[fl(0+J(2» 1 («-2 7 r))+ 7r ] for ft g £ £ [^ ^ n supp & 



-«(Q 



o *[e(C-27r)+e(2"- 1 0] 



for a.e. £ € [e n ,b n ] n (^rsupp 6). 
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-d n J-b n 

d n r -e n r b 



Hence, for almost every £ G [e„, b n ] n supp b (1 ( 2 „ 1 _i Supp b), we have 
0(0 + 9 (2"- 1 (e - 2tt)) - 0(£ - 2tt) - 0(2"" 1 O = (2m(£) + 1) tt, 

for some integer- valued measurable function m. This proves (v). 

We now prove the converse. Suppose ip G L 2 (M), supp ^ C S„ and the 
function &(£) = |V>(£)I satisfies conditions (i)-(v) of the theorem. By Theorem 1, 
to show that ip is a wavelet, it is sufficient to show that HV'lb = 1 and ip satisfies 
(1) and (2). We have, 

Mm = \\ni= I m)\ 2 da 

+ £'jb 2 (£)dt 
+2(e„ - a n ). 

By changing variables £ -> 2™- 1 (£-2tt), £ -*■ 2" _1 £ and £ ^ £-2tt in the first, 
second and third integrals respectively, we get 

2tt|M|2 = 2"- 1 / " [6 2 (2"" 1 (^ - 2i,)) + 6 2 (2"- 1 0] ^ 

rb n 

+ / [fo 2 (£ - 2tt) + fe 2 (0] dC + 2(e„ - a„) 

= 2"- 1 (fe„ - e n ) + (b n - e„) + 2(e„ - a„) = 2tt, 

where we have used (10) and property (iii) of the theorem. Hence, ||"0||2 = 1- 
We will now show that ip satisfies (1). Let 

Suppose £ > 0. Observe that 



\j2 l [a n ,b n ] 

lez 

(j2 / ([a„,e„]U[e„,6„]). 



ie 



lez 

So there is an I G Z such that £ G 2'[a„,e„] U 2 l [e n ,b n ). If £ € 2'[a„,e n ] then 
since 2 -/ £ G [a„,e„], by Lemma 1 we have, 2 J (2~'£) ^ S n if j ^ 0. That is, 
2 J £ G" 5„ if j y^ —I. Hence, p(£) = 1, by (i). Similarly by using (ii), we can 
prove that /?(£) = 1 if £ G 2 ( [e n , 6„] . A similar decomposition for £ < proves 
that p(0 = 1 for a.e. (eM. 
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Finally, we have to show that ip satisfies (2). For m £ 2Z + 1, let us denote 
the function on the left hand side of (2) by t m (£). Then 



UO = YJ(2 i t)j>(2J{£ + 2mn)) 



j>0 

E 

j>0 



ip (2 J (£ + 2tott - 2m7r)) ip (2J(£ + 2tott)) 



= i_ ro (£ + 2m7r). (15) 

Therefore, we have only to show that £ m (£) = for a.e. £, if to € 2Z~ + 1 
(negative odd integers). Suppose to £ 2Z~ + 1 and m ^ — 1. Let £ <G K and 
suppose that 2 J '£ € 5„. Since j > 0, n > 2 and to ^ — 1 and is odd, therefore 
2J'to ^ 0,±1,±2"- 1 . So by Lemma 1, we have 2^ + 2-2-?m7r £ S n . This implies 
that each term of £ m (£) is zero, which proves that t m is zero. It now remains to 
show that £-i(£) = for a.e £. We have 



MO = X>(2W(2^-2-2*7r). 

i>o 

Let £ e R and suppose that 2 J £ e 5„. Then again by Lemma 1, 2- 7 £ — 2-2% ^ S n 
if — 2 J ^ 0, ±l,±2 n_1 . So the only possible j's to contribute a non-zero term 
are J = and j = n — 1. Thus, 



MO = V>(0^(£ - 2tt) + ^(2"- i C)^(2"- 1 e - 2 • 2«-i7r). (16) 

Now, both £ and £ — 27r belong to 5„ only if £ e [e„, &„]. Hence, the first term 
of (16) is zero unless £ £ [e„, &„]. Similarly, both 2™" 1 £ and 2 n ~ 1 £ - 2 • 2 n ~ x -K 
belong to S n only when 2™~ 1 £ G [c n ,d n ], which is equivalent to saying that 
£ £ [e„,6„]. That is, the second term of (16) is also zero unless £ £ [e„,6„]. 
Thus we get t_i(0 = if £ ^ [e„, &„]. Now on [e„, 6„], we have, by (14) 

MO = &(0K2 n " 1 0e i[ ' (0 "' K ~ 2,r)] +&(2"" 1 e)6(e)e 4[£ ' (2 "" C)+8 ( 2 "" ( ^ 2 " ) ) 1 . 

If £ ^ [e„, 6„] n supp 6 n ( 2^rsupp b), then cither &(£) = or b{2 n ~ 1 ^) = 0. 
So*_i(£) = 0. And if £ £ [e n , b n ]n supp b^^kr supp fe), then by (v), £_i(0 = 0. 
This completes the characterization. D 

There is another equation which characterizes all wavelets ip such that ip 
is supported in the set S n and the function b = \ip\ is even. For n = 2, the 
following proposition is proved in [4] (Proposition 4.7, Chapter 3). We observe 
that the result can be extended to the general case. 

Proposition 1. Suppose thatip is a wavelet ofL 2 (M.), b = \ip\, and supp b C S n . 
Then b is almost everywhere even if and only if 

b 2 (0 + b 2 (2n-0 = l for a.e. £e [e n ,b n ]. (17) 
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Proof. Let ip be a wavelet of L 2 (U.) such that b = \ip\ is supported in S n . 
Suppose that b is an even function and £ € [e„, b n ]. Since — £ G [— &„, — e n ], we 
have 

1 = 6 2 (-e) + 6 2 (-e + 27r), by (8) 

= & 2 (0 + fo 2 (27r - £), since & is even, 

which is (17). 

Conversely, suppose that (17) holds. Since by (i) of Theorem 2, b is even on 
[«„, e„], it is enough to show that b is even on the sets [e„, 6„] and [c„, rf n ]. 

(i) Let £ <G [e„,&„]. Therefore, — £ e [— &„,— e n ]. Then 

& 2 (-£) + & 2 (-£ + 27r) = l, by (8). 
This fact, together with (17), gives us 

6(0 = K-0- (is) 

(ii) Now let £ <E [c„,d„]. Therefore, ^W^ •= [ e n,M- F rom (18) we get 

<2^H(-2^M- (19) 

Since — 2 n.-i £ € [ — &m ~ e n], using (8) we obtain 

,2/ 1 A , ,2/ 1 



6%-^n^+6%-^n^ + 27rj=l. (20) 

Now, since — £ € [— d n ,— c n ], we get (using (9)) 

b {-^ + 2ir)=b(-0- (21) 

Substituting (19) and (21) in (20), we get 

b 2 (^)+b 2 (-t) = l- (22) 

Comparing (22) and (7) we get 6(£) = b(— £), which proves the proposition. □ 

4 The wavelets associated with 5 n , n > 3 are 
non-MRA 

A multiresolution analysis (MRA) is a sequence of closed subspaces {Vj : j e Z} 
of L 2 (M) satisfying the following properties: 

(i) Vj C V j+1 for all j e Z 

(ii) UjzzVj is dense in L 2 (M) and C\ i& Vj = {0} 



84 B.BEHERA.S.MADAN 



(iii) / e Vj if and only if /(2-) e V j+1 for all j e Z 

(iv) there exists a function ip G L 2 (JBL) such that {<^(- — k) : k £ Z} forms an 
orthonormal basis for Vq. 

A function tp that satisfies property (iv) is called a scaling function for the 
MRA. If ip is a scaling function for a given MRA, then it is easy to see that 
(see [4]) there exists a 27r-periodic function mo in L 2 (T), called the low-pass 
filter, such that 

0(2O=m o (O0(O- (23) 

The function mo satisfies the equation 

|m (0| 2 + K(£ + 7r)| 2 = l fora.c£eT. (24) 

Using (23) and (24), one can then construct a wavelet ip associated with the 
MRA. For any such wavelet the following equation holds (see [4] for details): 

l£(0l 2 = £hfaW- (25) 

We will now prove the last statement in Theorem 2. 

Proposition 2. If n> 3, i/ien i/ie wavelets characterized in Theorem 2 are noi 
associated with any MRA. 

Proof. Let ip be any wavelet such that ip is supported in S n and let it be 
associated with an MRA. Let ip be the corresponding scaling function and mo 
be the low-pass filter associated with if. Using (25) we can easily find that 

10(01 = j &(2 n -' -1 if \l;\£2 l [e n ,b n }, 0<l<n-2 (26) 

I otherwise. 

Hence, 

fl if |£| <^ 

1^(201 = | &(2 n -'0 if ie|e2'- 1 [e„,6„], 0</<n-2 (27) 

I otherwise. 

Case 1. 6(0 ^ 1 on [e„,6 n ], i.e., 6(0 ^ on [c„,d„]. 

If |£| < a n , then from (23) and (26), we have |0(2£)| = |m (OI ' 10(01 = 
l m o(OI- Therefore, by (27), we obtain |m (0l = 1 if |£| < ^f • Since m is 
27r-periodic, we have 

|m (0l = l on [-^,^]+2-2"- 3 7r. 

Note that [- 3 f, g f}+2- 2™- 3 tt =[f,<^f}= 2"- 2 [e„,6„]. Now, from (26), on 
2 n - 2 [e„,6„] we have \<p{£)\ = 6(20- So |0(2O| = |m (0l ■ 10(01 = b(2£). But 
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by (27), |£(2£)| = on 2 n - 2 [e n ,b n }. Therefore, 6(2^) = on 2"- 2 [e„,&„]. That 
is, 6(0 = on 2 n ~ 1 [e n ,6 n ] = [c n ,<i„], which is a contradiction. 

Case 2. 6(0 = 1 on [e„,6„]. 

As above, we have 



1 if |£| < a n 
lv(0|= <1 or if £e2 l [-b n ,-e n ], 0<l<n 
otherwise. 



(28) 



Therefore, 



1 if|C|<t 
|^(20| - { or if £ € 2 i - 1 [-6„, -e„], < / < 
otherwise. 



(29) 



Now |0(2£)| = |m (0l ' \<p(0\ = \mo(0\ if l£l < *n, by (28). Therefore, by 
(29), | mo (01 = 1 if |£| < ^f- Using the 27r-periodicity of mo, we get 



K(0l 



1 on 



q^ q^ 
2 ' 2 



] -2-2 



n— 3„ 



Note that [-^,^]-2-2 



n— 3„ 



] = 2™ [—fon, -e n ]. Hence, on the 



interval 2™- 2 [-6„, -e n ], we have |0(2f)| = |m (0| ■ 10(01 = K(0l = 1- But 
by (29), 1^(201 = on 2"~ 2 [— &„, — e„], which is again a contradiction. 

Therefore, for n > 3, the wavelets characterized in Theorem 2 are non-MRA 
wavelets. □ 



5 The dimension functions of wavelets associ- 
ated with S n 

Let tp be any wavelet of L 2 (M). Define Vj to be the closure of the span of 
{ipl.k ■l<3, ke Z}. If {Vj : j e Z} forms an MRA of L 2 (R), then we say that 
ip is associated with an MRA, or ip is an MRA- wave let. 

Not every wavelet of L 2 (R) is associated with an MRA. In the last sec- 
tion we proved that the wavelets associated with the set S n are non-MRA 
wavelets. The first example of a non-MRA wavelet, which is an MSF wavelet, 
was given by J.L. Journe. The associated wavelet set is J = [— ^y7r,— 47r] U 
[— 7r, — |7r] U [|i")7r] U [47r, ^7r] . Another interesting non-MRA wavelet is the 
Lemarie wavelet which is also an MSF wavelet with the corresponding wavelet 
set L = [— f 7Tj — |tt] U [|7r, f 7t] U [^7T, ^7r] . These two wavelets belong to the 
class of wavelets characterized in section 3 (see Remark 3). 

Given a wavelet ip of L 2 (R), there is an associated function D^, called the 
dimension function, defined by 

M) = EE^( 2J « +2 M)i 2 ' 

j>i kez, 
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It was proved independently by Gripcnberg and Wang that a wavelet ip is an 
MRA- wavelet if and only if D^ = 1 a.e. For a proof of this fact see [4]. 

In this section we compute the dimension functions for the wavelets charac- 
terized in Theorem 2. In addition to providing a family of dimension functions, 
this will give an alternative proof of the fact that these wavelets arc not associ- 
ated with any MRA. These dimension functions arc symmetric with respect to 
the origin and attain all integral values starting from to the maximum value. 
Also note that for a fixed n, all wavelets whose Fourier transform is supported 
in S n have the same dimension function D n . 

Theorem 3. Fix n > 3. Let ip be a wavelet such that supp ip C S n and let D n 
be the dimension function associated with ip. Then 

a.e. z/|£|e [O.g^Tr] 
, . a.e. tf|£|e [^tt,^^tt] (2<l<n-l) 

-^nlsj \ n ..|„ ro»-i 2 n — 2 1 r l 

a.e. if \t\ e |_2^TT7T, 2^r^T 7r J = [ a n,e n \ 
o.e. if\£\ £ [|e|7r,7r] = [e„,7r]. 

Proof. For I £ Z, we define 

on+/— 1 on+/— 1 <yl 

VI = 2 „ _ 1 7T = 2'a„ and ^ = — — - _^ — tt = 2'~ 1 e„. 

Note that p = a n , p\ — 2p Q = 6„, p n = d„, gi = e„ and g n = c„. Also 
observe that pi < qi+\ < pi+i for all I > 0. As Z) n is 27r-periodic, it is enough to 
compute its values for £ £ [— it, tt]. Since [— tt,tt] C [— Pi,Pi], we will compute 
D n for the interval [— pi,pi]- Note that 




(0,pi] - |j2- i [po,Pi] = U[p_j,p_j + i]. 



(>0 i>0 

Case 1. £ e [|^7r, jl^Tr] = [po.Pi]- 

An elementary calculation shows that if fc 7^ 0,-1, then 2 : '(£ + 2fc7r) lies 
outside the support of ip for all j > 1. Therefore, we have only to consider 
fc = 0,-1. We consider the intervals [poi^i] an d [giiPi] separately 

(a) If £ e [po,9i]j then 2 J £ e [6„,c„] for 1 < j < n - 1 and 2 J £ > d n for 
j > n. By a similar argument 2 J (£ — 27r) ^ supp -0 for all j > 1. So -D„(£) = 
for a.e. £ G [po,<7i]- 

(b) If £ € [<7i, pi], then it can be shown that 2 J (£ — 2tt) g' supp ip if j 7^ n— 1. 
Therefore, £>„(£) = |^(2™- 1 £)| 2 + \ijj (2™- 1 (C - 2vr)) | 2 = 1, by (11), as 2"" 1 £ € 

[9n,Pn] = [C n ,rf„]. 

We now proceed to compute D n on other sets. Observe that 



-1 
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+EEI^( 2 ^+ 2 ^))I 2 
*;=i j>i 

= D-(0 + D° n (0 + Dt(0, say. 

Let £ e [f^TT, ^=^7r] = [p_ l+1 ,p_j +2 ], / > 2. 

In the sum D~(£), the only pairs of (j,k) giving a possible non-zero term 
correspond to j = n — m — 2 and k — — 2 m , < m < n — 3. Thus, 

n— 3 

^7(0 = E 1^ (2^ m ^ 2 (e - 2 • 2"V)) | 2 . 
Similarly 

n— 3 

^(0 = E 1^ (2"- m - 2 (? + 2 • 2"V)) | 2 . 



m— 



Case 2. £ e [|>— I 7 "", 2 2 ^rr 7r ] = \p~i+\,P-i+i], I > n. 
Observe that in this case 

2"-™- 2 (£ + 2 • 2 m 7r) <= [c„, d„], for < m < n - 3. 

Therefore, 

|^(2"- m - 2 (£ + 2 • 2" l 7r)) I 2 + |i/>(2"-" l - 2 (C - 2 • 2 m 7r)) | 2 
= |^(2"- m - 2 (£ + 2 • 2" 1 tt)) I 2 + |i/>(2™-" l - 2 (C + 2 • 2 m 7r) - 2 n_1 7r) | 2 
= 1, by (11). 

Hence, we get D+(0 + D n (0 = « - 2. 

Now, \p_i +1 ,p_i +2 ] = [p-i+i,q-i+2\ U [g_j +2 ,p_j+2]. 

(a) If £ G [p-j+i,?-l+2], then 2'f £ supp & if j ^ J - 1. But 2'-^ G 
[po,«i] - [oi,e„]. Therefore, D°(£) - l^'" 1 ^ 2 = 1. 

(b) If £ e [«_i+2,P-i+2], then 2*f £ supp $ifj^/-l,n + J-2. Now 
2"+'- 2 £e [q n , Pn ] = [c„,rf„]. By (7) we get D° n (£) = |^(2"+'- 2 C)| 2 +|^(2 i - 1 0| 2 = 
1. 

Thus, in either case -D n (0 = n — 2+1 = n — 1. We have proved that 

D n (0=n-l,if£€ 



2 n-l 2 n- 



2"-!"' 2™-l 



Z > n. That is, 



A»(0 = n - 1 for a.c. £ e [0, 



Case 3. £ e [|^7r, ^^tt] , Z = 2. 
As in Case 2, it can be shown that 



! (£-2-2 m 7r) and 2 n - m ~ 2 (£ + 2 • 2 m ir) 
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are not in supp ip for all m such that < m < n — 3. Therefore, D n (£) = 

Now if £ £ [ ^"-l 71 "' 2 2"-i 1 ?r ] ' then 2-?£ ^ supp if j 7^ 1; and if £ belongs 
to the set [ 2 2 „_^ 1 7r, ^-i 71 "] , then 2 J £ ^ supp ip if j 7^ 1, n. In cither case, 

A,(0 = I>°(0 = I- 



-i+i 



Case 4. £ e [ 2"— l 71 "' 2 2"-i ^j > 3 < Z < n — 1. (This case is required if n > 4.) 



1 + p — i OTI + P 



We can write 2" m 2 £ e [ 2 2 „_ 1 7r, 2 „_ 1 7r] , where p = n — m — I — 1. Using 

the condition on support of 0, we can show that if £ G [ 2 „„_ 1 7r, 2 _ 1 7rl , then 

£ ± 2 • 2™- 2 7T e supp -0 only when p < -1. So in order that t/>(2™-" 1 - 2 £ ± 2 • 
2 n ~ 2 7r) 7^ 0, we must have p = n — m — I — 1 < — 1, i.e., m > n — I. We also 
have, < m < n — 3. Therefore, 1 < n — m — 2 < I — 2. Thus 



Ai(0 = }^I0(2^)| 2 
J">1 
n— 3 

2 {|^(2™- m - 2 (£ + 2 • 2 m 7r))| 2 + |0(2™- m - 2 (£ - 2 • 2 m 7r))| 2 } 
1=0 

l-i 
|^(2^)| 2 + J2 {1^(2^ + 2 • 2"- 2 ^)| 2 + |0(2 S £ - 2 • 2"- 2 ^)| 2 } 



m— 

i-2 

El 

i>i 



Now, 2 s £ + 2-2"" 2 7r e [c„,d„], 1 < s < 1-2. By (11), the second sum on the 
right hand side of the last equality above is equal to I — 2. Also as in the previous 
cases, the first sum can be shown to be equal to 1. Hence, D n (£) = 1—1. □ 

Remark 3. In [3] MSF wavelets were considered where they were called uni- 
modular wavelets. Let ip be an MSF wavelet and K = supp tp. 

Define K + — K n [0, 00) and K~ — K n (—00, 0]. One of the results proved 
in [3] is about wavelets ip such that K~ = —K + and K + consists of two disjoint 
intervals. They proved the following: 

(A) Let V>(£) = M0Xk(0; where K = K+ U K~,K~ = -K+ and K+ = 
[a n ,n] U [2 n ~ 1 7r,d n ] with |/z(£)| = 1. Then is a wavelet for L 2 (M). Moreover, 
each MSF wavelet for which K~ = —K + , and K + is a union of two disjoint 
intervals is of this form. 

If we define b = X[e„,ir] on [ e n,fr„], and extend it to S„ by using (i)-(iv) of 
Theorem 2, then we get all wavelets described in (A). In particular, for n — 3, 
the corresponding wavelet is the Journe wavelet. 

The paper [3] also contains the following result. 

(B) Let rjj(0 = A*(0x*(0 where 



K~ = 



-2il- 2 J^)^-(l- 2p+1 



2™ - 1 / V 2™ - 1 
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and 



K^ 



2(p+l) 2(2p + l) 

~^ ^^i ~^ ; — t 1 " 

2™ - 1 2" - 1 



U 



2"(2p+l) 2" +1 (p+l) 

IT IT 

2™ - 1 ' 2™ - 1 



for n > 3, 1 < p < 2"- 1 - 2 and |/^(£)| = 1. Then ip is a wavelet for L 2 (M). 
Moreover, each MSF wavelet for which K ~ is an interval and if + is the union 
of two disjoint intervals is of this form. 

If we define 6(£) — on the interval [e„,6„] and extend it to S n by using 
(i)-(iv) of Theorem 2, then we get 

k = X[-fc„,-a„]u[a„,e„]u[c„,d„] 

which corresponds to the case p — 2™~ 2 — 1 in (B). In particular, for n — 3, we 
get the Lemarie wavelet. 
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Abstract 

A generalisation of Ostrowski's inequality for functions defined on lin- 
ear spaces in terms of Gateaux derivatives is given. Applications for semi- 
inner products in normed linear spaces are also pointed out. 
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1 Introduction 

The following result is known in the literature as Ostrowski's inequality (sec e.g. 
[18]). 

Theorem 1 Let f : [a, b] — > R be a differentiable mapping on (a, b) with the 
property that \f (t)\ < M for all t £ (a, b) . Then 



1 '* 



/(*)-&— : / f^ dt 



< 



1 (s-*±*) 



4 (b - a) 



2" 

(b-a)M, (1) 



for all x G [a, b] . 

The constant \ is the best possible constant in the sense that it cannot be 
replaced by a smaller constant. 

This result has been generalised for absolutely continuous functions / : 
[a,b] -> R (see [19], [20] and [22]) as follows. 
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Theorem 2 Let f : [a, b] — ► R &e absolutely continuous on [a, 6] . TTien /or aH 
a; G [a, b] , we have 

1 



/(*) 



l _, c j - 

4 



fc— a 



b — a 

C'-'Oll/'lL, 



/ (<) d£ 



(2) 



< < 



( P +i)p 



p+i /, \p+i 

a: — a \ i / 6— r 



b—a 



b—a 



if f e ioo [a, 6] 



(&-a)MI/'ll,,tf /6L,M]; 



fc-o 



J 111 



where \\-\\ r (r£ [l,oo]) are i/ie usual Lebesgue norms on L r [a, b], i.e., 



and 



Ifllloo := ess SU P l5(*)l 
te[o,6] 



\g(t)\ r dt) , re [l,oo). 



The constants \, — — T and \ respectively are sharp in the sense mentioned in 



Theorem 1. 



( P +i)p 



The above inequalities can also be obtained from Fink's result in [23] on 
choosing n = 1 and performing some appropriate computations. 

Recently, the Ostrowski type inequalities (2) were extended to functions with 
values in Banach spaces (see [2]). 

Theorem 3 Let {X, ||-||) be a Banach space with the Radon-Nikodym property 
and f : [a, 6] — ► X an absolutely continuous function on [a, b] . Then we have 
the inequalities 



f{s)-^- a {B) J /(/),// 



a+b 



b — a 



(&-o)|||/ , ||| [0l6] , 00 ,*/ fei^M;*); 



< < 



(q + iy 



s — a \ 
b-a) 



b-^ q+1 



b — a 



(3) 



(b-a)-'\\\f'\\\ laMiP , 



if f eL p ([a,b];X);p>l, \ + \ = \, 



a + b 



11/ lll[a,6],l 
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for any s £ [a, b] , where 



\\\f\\\la, b] ,oo--=eSS Sup ||/'(i)|| 

te[a,b] 



and 



\f'W\la, b]j> --={ / \\f'(t)\\ P dt) , P >1. 



In this paper we establish similar results for functions / defined on segments 
in general linear spaces. We recall that, if a,b E X, a ^ b and X is a real linear 
space, then by the segment generated by a and b we understand the set 

[a, b] := {x e X\x = (1 - A) a + Xb, A e [0, 1]} . 

Natural applications for semi-inner products in normed linear spaces are also 
provided. 



2 Ostrowski Type Inequalities 

The following result of Ostrowski type for functions defined on linear spaces 
holds. 

Theorem 4 Let X be a linear space, a,b e X, a ^ b and f : [a, b] C X — > M be 
a function defined on the segment [a, b] and such that the Gateaux differential 
V/ [(1 — •) a + -b] (b — a) exists a.e. on [0, 1] . Then for any s <G [0, 1] we have 
the inequalities: 



f [(1 -s)a + sb] f[(l-t)a + tb] dt 



+ (-§)' 



||V/[(l--)o + -6](6-o)|| 00 , 

if V/ [(1 -■) o + -6] (6 -o)G Loo [0,1]: 



< <^ 



(<? + !)* 



S i +1 + (1 - s) q+1 ] ~ q || V/ [(1 - •) a + -b] (b - a) 



(4) 



if V/[(l -■)(* + ■&](&- a) GL P [0,1], p>l, i + | = l; 



||V/[(l--)a+-6](6-a)|| 1 



where \\-\\ (r G [l,oo]) are the usual Lebesgue norms on L r [a,b] , i.e., 

II V/ [(1 — ■) a + -6] (& — a) Hoo -=ess sup |V/[(1 - u)a + ub) {b- a)\ 

ue[o,i] 



and 

||V/[(l--)o + -6](6-o)|| p 



|V/[(1 -u)a + ub](b-a)\ p du) ,p>l 
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The constants j, — - — r and -k respectively are sharp in the sense that they 

(8+1) « 
cannot be replaced by smaller constants. 

Proof. If we consider the auxiliary function g : [0, 1] — ► R, 

g(t)~f[(l-t)a + tb] 

then we observe that for any s€ (0, 1) 

j< B ) =lim a(t)-a( S ) =lhn f[(i-t)a + tb}-f[(i- s )a + s b} 

t — *S f — S t — *s ~f~ — S 

,. f [(1 - s) a + sb + h (b - a)] - f [(1 -s)a + sb] 
= hm 

h-yQ h 

= V/ [(1 -s)a + sb] (b - a) 

showing that s is a point of differentiability for g iff the Gateaux differential of 
/ in the point (1 — s) a + sb along the direction b — a exists. 

Since we assumed that V/ [(1 — •) a + -b] (b — a) exists a.e. on [0, 1] , it fol- 
lows that g is absolutely continuous on [0, 1] and by Theorem 2 we may state 
that 



9(') 



9 (t) dt 



+ («- If 



if g' e ioo [0, 1] 



< < 



(g + iy 
1 

2 + l S 



s^ + (l- s y +1 q \\g'\\ p ,i{ 9 , Gi p [0 ) l] ) p>l 1 i + i = l; 



for any s € [0, 1] , and the inequalities (4) are proved. 

The sharpness of the constants follows by the scalar case. ■ 

The best inequality we can get from (4) is embodied in the following corollary. 

Corollary 1 With the assumptions of Theorem 4, we have the inequality: 



a + b 



1 



f[(l-t)a + tb] dt 



||V/[(l--)o + -6](6-o)||, 



< < 



if V/ [(1 -•) o + -6] (6 -o)G Loo [0,1]: 

; 1 i l|V/[(l--)o + -6](6-o)|| p! 
if V/[(l--)o+-6](6-o)GL p [0,l]: 



(5) 



v 2 II V/ [(1 -.)o + -6](6-o) \\ x . 
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The constants j, , ,,,,<,.> 

2(<,+l)« 



- — r and i are sharp in the sense mentioned above. 



Remark 1 If the function f : [a, b] C X — ► M. is convex on [a, b] then 
V/ [(1 — •) a + -b] (b — a) exists a.e. on [0, 1] and 

V/ [(1 -s)a + sb] (6 - a) = V±/ [(1 - s) a + sb] (b - a) 

for a.e. s G [0,1], where V±/ [(1 — s) a + sb] (b — a) are the Gateaux lateral 
derivatives which exist in every point s G (0, 1) . 

If we recall the Hermite-Hadamard integral inequality for convex functions 
defined on the segment [a, b] C X, 

then the above corollary applied for convex functions / : [a, b] C X —> R provides 
the following counterpart inequality for the first inequality in (6): 



< / /[(l -t)a + tb]dt- f 



a + b 



4 l|V±/ [(1 -•)<*+■&] (6 -a)IL, 

if Vi/^l-Oo+^Kft-oJeLoolCl]; 

-, <f ^ IIV+/ [(1 - •) a + -b] (b-a)\\ 

2(« + l)« 

if V±/[(l--)o + -6](6-o)eL p [0,l],p>l, ^ + ^ = 1; 



v 2 l|V±/ [(i-.)o+-6](6-o) Hi. 



(7) 



3 Applications for Semi-Inner Products 

Now, assume that (X, ||-||) is a normed linear space. The function fo(x) 
| || a; || , x G X is convex and thus the following limits exist 



(iv) (x, y) s := (V+/ (y)) (x) = ( lim + 



||y+ia:|| — ||y|| 



(v) (x,y) i :=(V-f (y))(x)= lim 



|y+sa:| — ||y| 



for any x,y G X. They are called the lower and upper semi-inner products 
associated to the norm ||-|| . 

For the sake of completeness, we list here some of the main properties of the 
mappings that will be used in the sequel assuming that p, q G {s, i} and p =/= q : 



(a) (x, x) = \\x\\ for all x G X; 
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(aa) (ax, f3y) = a/3 (x, y) of a, j3 > and x, y € X; 
(aaa) (x,y) < ||x|| ||y|| for all i,t/(;I; 

(av) (ax + j/, x) p = a (x, x) p + (y, x) p if x, y e X and ael; 

(v) (-x,y) p = - (x,y) q for all ijel; 

(va) (x + y, z) p < \\x\\ \\z\\ + (y, z) p for all x,y,z E X; 

(vaa) The mapping (•, •) is continuous and subadditive (superadditive) in the 
first variable for p = (or p = i)\ 

(vaaa) The normed linear space (X, ||-||) is smooth at the point x E X\ {0} if 
and only if (y, x ) s = (y, x ) i for all y E X; in general (y, x) i < (y, x) s for 
all x,y E X; 

(ax) If the norm ||-|| is induced by an inner product (•,•), then (y, x) i — (y, x) — 
(y, x) s for all x,y E X. 

Now, if we apply Theorem 4 for the function /o : X — > R, /o (x) = g ||x|| , 
then we may state the following proposition: 

Proposition 1 For any a,b E X and s E [0, 1] , one has the inequality: 



< 2x( 





ll(l- 


- s) a + 


sfe|| 2 - 


[;♦<- 


^ 


sup 
ue[o,i] 


(6 -a 


1 


>+l + (1 _ s )9+!" 


(n X1U 



||(1 -£)a + £6|| 2 di 



(6 - a, (1 - ?i)a + M&) s(i) 



(8) 



(&-o,(l-w)a + u6> sW 



d« 



(b - a, (1 - u) a + ub) s(i) 



du 



p> 1, i + i = 1; 

^ i p q ' 



(9) 
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and, in particular, 



< / \\(1 - t) a + tb\\ 2 dt - 
Jo . 



,°1 



a + 6 



< < 



sup 
«e[o,i] 



(g+l)« \J o 



(b - a, (1 - u) a + ub) s{i) 



(6-o,(l -u)a + u&) s(i) 



rfw 



p> 1, i + i = 1; 

^ ' p q ' 



(6 - a, (1 - it) a + u6) g (,-) dit. 

J u 

Remark 2 Since 

(6-a,(l -u)a + u6) s(i) < ||6-a|| ||(1 - u)a + ub\\ 



then 



sup 
«e [o,i] 



(10) 



(6 — a, (1 — u) a 4- w&) g /f) <||fr — o|| sup ||(1 — u)o + u6|| 

«e[o,i] 

= || b — a|| max{|ja|| , ||6||} , 



(6 — a, (1 — u) a + ub) us du < ||6 — a|| / ||(1 — u) a + ub\\ p du 



and 



L 1 

<6-a,(l-«)a + «6>. (4) |du<||6-a||/||(l-w)a + t*||d tt 

o o 

and &w (#) we may deduce the inequalities 

||(l-s)a + s6|| 2 - / ||(l-t)a + i&|| 2 dt 



(11) 



max{||a|| , ||6||} 



+ (s-hY 



< 2 ||6 — a||x < 



||(1 — u) a + m6|| p du 



|(1 — u) a + u&H du ■ 



(q+lY 



5-H- 1 + (1 - s) 9+1 



p> 1, 1 + i = 1; 



(12) 
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and, in particular, 



< / ||(1 - t) a + tb\\ 2 dt - 
Jo 

( 1 „, , 



a + b 

2 



:{N|,||6||}, 



< \\b-a\\ x < 



(q+1)* \J 



||(l-«)a + u6|| p du 



(13) 



p>l, - + - = 1; 



|(1 — u) a + u6|| du. 



Now, if we choose the function /o : X — > R, /o (x) = ||x|| , then we may state 
the following proposition as well. 

Proposition 2 For any a, b linearly independent vectors in X and s <G [0, 1] , 
one has the inequalities: 



||(l-s)a + s6|| - / ||(1 -t)a + tb\\dt 



1 / 1 \2 

4 + ( S -^) 



sup 
«e [o,i] 



(b- a, (1 - u) a + ub) s(j) 



< <^ 



(q+iy 



5« +1 + (1 - s) 



9+1 



(1 -u)a + ub\\ 

(b - a, (1 - u) a + ub) g ^ 



||(1 - u)a + ub\ 
(b-a, (1 -u)a + ub) s{i) 



du 



P> 1, i + i = 1; 



1 / 1 \2 

4 + ( S -5) 



]|(1 — w) a H- w&H 
||6-o||, 



d/< 



(14) 



< < 



(g + iy 



^ +1 + (1 - . 



\9+l 



16 -oil 



p>l, i + i = 1; 



2 + 1-31 



||6-o||, 
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and, in particular, 



< / ||(1 - t) a + tb\\ dt - 
Jo 



a + b 



< { 



- sup 
4 tie [o,i] 



1 

2(9+1) 



(b - a, (1 - u) a + ub) s{i) 



|(1 -u)a + ub\\ 



(b - a, (1 - u) a + w6) s (j) 
||(1 -u)a + ub\\ 



du 



(b-a, (1 - u) a + ub) s{i) 



||(1 -u)a + ub\\ 



p> 1, i + i = 1; 



dw. 



From the first inequality in (15), we get the simple and nice result : 



< / ||(1 -*)o + *6||d*- 



a + b 



< -116 -oil, 

- 4 II II' 



holding in any normed linear space X and for any a, b G X. 
The constant \ is sharp. 
Indeed, if we assume that (16) holds with a constant C > 0, i.e. 



< / ||(1 -t)a + tb\\dt- 
Jo 



a + b 



<C ||6 -all 



and choose X = R, ||-|| = |-| , a = — 1, 6= 1, then we get 



i ,i 1 

||(1 - t) a + tb\\ dt = \2t-l\dt=- 

(i Jo 2 

""'' =0, ||6-o|| = 2 



and (17) becomes 
showing that C > \ . 



<2C, 



(15) 



(16) 



(17) 



4 Applications for Functions of Several Variables 

Now, let fi C M. n be an open convex set in R". 

If F : £1 — > R is a diffcrentiable function on f2, then, obviously, for any c G £1 
we have 

V-dF(c) 



VF (c)(5) 



i— l 



■vu y g 
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where ^- are the partial derivatives of F with respect to the variable 
(i = l,...,n). 

Using Theorem 4, we may state the following inequality: 



F[(l-s)a + sb\- I F [(1 -t)a + tb] dt 
Jo 



(18) 



4 ' V S 2 



sup 
«e[o,i] 



^ dF((l-u)a + ub) 



dx,. 



< i 



(g + iy 



S 9+1 + (1 - s) 



9+1 



E HZ & - a i) 



dxi 



da 



1 

s 

2 



2^ HZ ' <A ~ a *) 



i=i 



<9x,; 



<iu. 



Now, observe that 



^ dF((l-u)a + u., 



dxi 



dF ((1 -u)a + ub) 



< < 



SF ((1 -u)a + ub) 
dx 

dF ((1 -u)a + ub) 



dx 



where |||-|||^ [i € [l,oo]) are the t— norms in W 1 , i.e. 



ll|6-o||li 
\\\b-a\\\ p , 

i 

lll&-o|||=o 



l«llli :=X> 
MIL :=(l> 



a > 1 



and 



whereas 



w L„ := max \Vj 



dF ((1 - u) a + ub) _ I dF ((1 - u)a + ub) dF ((1 - u) a + ub)' 

dx \ dxi dx n 
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Consequently, we get 



< <^ 



and 



sup 

ue[o,i] 



l&-a||li SU P 

u€[0,l] 



|6-a||L sup 
ue[o,i] 



& — «llloo SU P 

ue[o,i] 



^ dF((l-u)a + ub) 
dF((l-u)a + ul 



dx 

dF ((1 - u) a + ub) 
dx 

dF ((1 - u) a + ub) 



(19) 



l + T3 = l > a>l 



dx 



^ dF((l-u)a + ub) 



du 



(20) 



< < 



16 -oil 



lll&-a|||* 



16 -oil 



dF ((1 -u)a + ub) 

dx 

dF ((1 -u)a + ub) 



dx 

dF ((1 - u) a + ub) 

dx 



du 



du 



du 



for p > 1. 

In practical applications we usually assume that 



dF 



dxi 



:= sup 

o,oo -sen 



dF(u) 



dxi 



< oo. 



Then, by the first inequality in (18) we deduce: 



F[(l 



s) a + s 



b] 



< 



F[(l-t)a + tb] dt 


)] 


n 

E 


dF 
dx. 


1 



\h - Oj| 



for any s G [0, 1] . 



(21) 
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In particular, we have the following mid-point inequality 

\bi -Oi\. (22) 



f{^)- f Q F[il-t)-a + tl]dt\<\± 



dF 
dxi 



n.oc 
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Abstract 

We prove the following theorem, already proved by S. Steinberg [6] in 
one variable. Let d > 1 be an integer and O C C d be a connected open 
set. Suppose that the mapping 

T : n — ► B{X) 

z _- > T(z) 

is holomorphic in Q, and that T(z) is compact for all z £ fl. Then we 
have the following alternative: 

(i) either for all z € Q, e — T(z) is not invertiblc; 

(ii) either z i— » [e — T(z)]^ 1 is meromorphic in O. 

When dimX < oo, it is a consequence of Cramer's rule. When dimX — 
cxj, the proof is based on spectral theory and complex analysis. 
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1 Introduction 

Throughout this article, X denotes a complex Banach space and B(X) is the 
set of bounded linear operators on X. B(X) is a Banach algebra (with operator 
composition as multiplication) and we denote its unit by e (as Rudin, see [5, 
p.227-230] for details). 

We aim to prove the following 
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Theorem 1 Let d > 1 be an integer and 51 C C d be a connected open set. 
Suppose that the mapping 

T : 51 — ► B(X) 
z —► T(z) 

is holomorphic in 51, and that T(z) is compact for all z G 51. Then we have the 
following alternative: 

(i) either for all z & 51, e — T(z) is not invertible; 

(ii) either z i— ► [e — T(z)]~ 1 is meromorphic in 51. 

Steinberg [6] proved this theorem in the case d = 1 . 

Motivation. The typical situation where the Steinberg theorem should be 
used is the following. Suppose that a linear partial differential equation (PDE) 
problem can be stated as : 

Vz e 51, for b(z) e Y, find u(z) e X such that 
A(z)u(z) = b{z) 

where X and Y arc Banach spaces, z <— > A(z) is holomorphic in 57 and A{z) € 
B(X, Y) is an isomorphism minus a compact perturbation for all z € f2 : 

A(z)= e - 6(z) =0 o [e-0o 1 e(^)] (1) 

isomorphism compact compact 

Thus T(z) — Oq 1 Q(z) € B(X) is compact for all z e 51 and holomorphic in 51. 
For example, using this technique, the electric field scattered by a perfectly 
conducting object is shown to be meromorphic with respect to the frequency [4]. 
In order to deal with problems depending on more than one parameter, we have 
to generalize this Steinberg theorem to compact operators T(z) with z E C d , 
where d > 1 denotes the number of parameters of the problem. Note also that 
meromorphy is a necessary condition for convergence of Pade approximants (see 
[!])■ 

The outline is as follow. In section 2, we recall some elements of spectral 
theory. In section 3, we define holomorphic multivariate X- valued function. In 
section 4, we state precisely the notion of meromorphy for operator- valued func- 
tions in C d . In particular we emphasize that in finite dimension, the Steinberg 
theorem reduces to Cramer's rule. In section 5, we prove the Steinberg theorem 
for several variables. 



2 Some basic facts about operators 

Our proof of theorem 1 will make wide use of symbolic calculus and spectral 
properties of compact operators. We recall here, without proof, only what is 
needed to understand the proof of theorem 1. 
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Lemma 1 [5, p. 235] G = {T £ B(X) : T invertible} is an open subset of 
B(X) and the mapping 
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<t(T) 7^ and tr(T) is compact (closed and bounded) in C (see Rudin [5, p. 235]). 
Moreover <r(T) changes continuously with T, precisely : 

Lemma 2 [5, p. 239] Suppose T £ B(X), O is an open set in C, and a(T) C O. 
Then there exists e > suc/i i/iai ct(T+5') c O /or every S £ B(X) with \S\ < e. 

Symbolic calculus, (see Rudin [5, p. 240-248] for details) If P(z) = a + 
a\Z H — • 4- a n z n is a polynomial, we can define P(T) for every T £ B(X) by the 
formula 

P(T) = a e + oiT + • • • + a n T n £ B{X) (2) 

Moreover, we can do the same extension for any / £ 0(0), the algebra of 
complex holmorphic function on the open set OcC, using a Cauchy formula. 
Precisely, we define a mapping / £ 0(0) ^ f £ A = {T £ B(X) : a(T) C 0} 
by the Cauchy formula : 

/» = ^-/'/(A)(Ae-T)- 1 dA (3) 

2iir J r 

where T is any contour that surrounds o~(T) in O. If / is polynomial, the two 
formula (2) and (3) coincide. In particular, for the holomorphic functions in O 
defined respectively by lo : z n 1 and A : z h^ z, we have for all T £ Ao, 

To(T) = ^-J^\e-T)- 1 d\ = e (4) 

X(T) = — [ \(\e-T)- 1 d\ = T (5) 

This mapping / h^ / is an algebra isomorphism [5, p. 244], in particular fg = fg. 
Moreover, we have the important 

Lemma 3 (spectral mapping theorem) [5, p. 244] Suppose T £ Ao and 
f£0(0). Then 

a(f(T)) = f(a(T)) 

It is worth noticing that all the facts mentioned above remain true when 
B(X) is replaced by any Banach algebra (see [5, p.227-259]). However, the 
compact elements of B(X) have additional crucial properties. Recall that T £ 
B(X) is compact if the closure of T(U), U the open unit ball of X, is compact 
(in X). 
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Lemma 4 [5, p. 103] If T G B(X) is compact, o~(T) is at most countable, and 
has at most one limit point, namely, 0. 

Lemma 5 [5, p. 99] If T G B(X) is compact, and A =/= 0, then the nullspace 
M(\e — T) is finite dimensional. 

3 Holomorphy for operator-valued functions 

In the sequel, fi denotes an open subset of C d . 

Definition 1 Let W be a complex topological vector space, f : fl — ► W is said 
to be holomorphic in fi if it is holomorphic with respect to each complex variable 
zi, i = 1, . . . , d, that is the complex partial derivatives 

df , . .. f(zi,...,Zi-i,U,z i+ i,...,Zd) - f(zi,...,Zd) . 

dz~ {z) = t h Z- ^~J- ,» = l,...,d 

exist (in the topology of W) for every z — (zi, . . . , Zd) G H. 

Let 0(fl, W) the set of holomorphic W- valued function on Q,. In the sequel, 
we consider only the case where W is a complex Banach space (namely C, X, 
or B(X)). We simply write 0{Q) for 0{Q,C)- 

The following lemmas are very useful for proving holomorphy. 
Lemma 6 Suppose that the mapping 

T-.n — ► B{X) 

z i— ► T{z) 

is holomorphic and that T(z) is invertible for all z G fi. Then the mapping 
z >— » T(z)^ 1 is holomorphic in fl. 

Proof. T(z)^ 1 = I(T(z)). Use the chain rule and lemma 1. D 

Lemma 7 If the B{X) -valued mappings z i— ► T(z) and z i— ► R(z) are holomor- 
phic in f2, then the mapping z i— ► T{z)R{z) is holomorphic in Q. 

Proof. Use the chain rule, after noting that the mapping (S,T) i— ► ST 
from B(X) x B(X) to B(X) is bilinear continuous, hence diffcrentiable. Or just 
proceed by direct calculus, using definition 1 above. D 

Lemma 8 Let U be a measurable subset of R™. Suppose that the mapping 
T : (z, A) i— > T{z, A) from f2 x U to B(X) is such that : 

• z i— ► T(z, A) is holomorphic almost everywhere in U ; 

• t i— ► T(z, A) is measurable for every z G O; 
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For each z£il, there exists a neighborhood V of z and a positive function 
8 : U — ► M. such that, almost evrywhere in U , 

vtev, \T(t,\)\<6(\) 



Then the mapping z i— > G(z) = / T(z, X)d\ is holomorphic in Q. 

Ju 

Proof. Let z e O and V be a neighborhood of z as in the hypothesis of 
the lemma. By Lebesgue theorem, G(z) is continuous in V. Hence we can use 
the Morera caracterization of holomorphic function, for each of the d complex 
variables. The Fubini theorem gives the result. Indeed, let Q be a triangle in 
the Zi-plane of V . 

J G{z)d Zl =[[ T(z, \)d\d Zl = [ ( [ T(z, X)dzi) d\ = 
Jo Jo Ju Ju \Jo ) 



4 Meromorphy for operator valued functions 

We may define a mcromorphic function / : f2 — ► X as a function which can 
be written as / = 4, with h : ft — > X and <f> : Q — ► C holomorphic. However 
such a global definition is rather impracticable and we use a local definition of 
meromorphy, adapted from Range [3, p. 233]. 

Definition 2 / : Q — ► X is meromorphic in 51 if for every a G £1 i/iere are a 
connected neighborhood U G f2 onrf holomorphic function g G 0(U) with g ^ 0. 
smc/i i/iai <?/ can oe holomorphically continued into U . 

Remark 1 The main problem with a meromorphic function f in £1 is that f is 
not well defined as a function : f(z) is not defined when g(z) = 0. 

Remark 2 Connectedness of U is crucial in order to avoid that g = in an 
open subset of U . In other words, the germ of holomorphic function defined by 
g at each point t G U can not be null (see Range [3, p. 233] for a definition in 
term of germs). However, if U happened to be not connected, we can always 
choose a smaller neighborhood which is connected (a polydisc for example). 

Remark 3 In the particular case d = 1, every meromorphic function f : £1 — > 

X where f2 is an open set of C, can be written as f = 4 with h : fi ► X and 

4> : f2 — ► C holomorphic. 

For d > 2, this result is no more true in general, but it still holds if £1 is a 
Stein manifold [3, p. 24-9]. In particular, it holds if £1 is a polydisk B(z, R) = 
\t G C d ; \ti — Zi\ < Ri,Vi = 1, . . . , dj, and this is all we need to apply extrap- 
olation by Fade approximants [2]. 
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Obviously, / G 0(il, X) is meromorphic in il and the sum of two meromor- 
phic function is meromorphic. As a more interesting example, we show that 
theorem 1 when dimX < oo is nothing else than the Cramer's rule. 

Lemma 9 Let X be a complex finite dimensional vector space and fl be a con- 
nected open subset ofC. Suppose that 

T-.n — ► B{X) 

z i— ► T(z) 

is holomorphic. Then we have the following alternative : 

(i) either T(z) is nowhere invertible (■&■ det{T{z)) = in Q); 

(ii) either T(z)^ 1 is meromorphic in fl. 

Proof. Since X is finite dimensional, we choose a base for X and identify 
T(z) as a matrix with holomophic coefficients. Suppose that (i) is false. Let 
z G fi, therefore f2 is a connected neighborhood of z. The Cramer's rule 

m- * = ^^ntr- , f ^ m) * o 

shows that det{T{t))T{t)~ l can be continued into a holomorphic function in O, 
namely T(t) com . D 

5 Proof of theorem 1 

Let X be a Banach space and Q, c C d be an open set. Let 

T-.n — ► B{X) 

z ,— > T(z) 

be holomorphic in f2, and suppose that T(z) is compact for all z G il. 

If e — T(z) is nowhere invertible, there is nothing to prove. 

Suppose exists zgil for which e — T(z) is invertible. We have to show that 
[e — T{z)]" 1 is meromorphic in Ct. 

First consider the set 

U = {z ett; e- T(z) is invertible } ^ 

[/ is open (by lemma 1 and continuity of z i— > T(z)) and z h^ [e — T(z)]^ 1 is 
holomorphic in [/ (sec lemma 6), hence meromorphic. 
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Now, suppose z € ft \ U. We are going to prove that, for all z in a neigh- 
borhood of z , the operator e — T(z) can be splited into a regular part (with 
holomorphic inverse) and a singular part on a finite dimension space (hence with 
meromorphic inverse). 



Since e-t(zo) is not invertible, we have 1 € a(T(zo)), where cr(T(zo)) denote 
the spectrum of operator T(zq). Since T(zq) is compact, 1 is an isolated point 
of <j{T{zq)) (see lemma 4). Thus we can choose 0\ and 2 , two open subsets 
of C, such that 

Oifl0 2 =0 

Oina(T(* )) - {1} (6) 

a(T(z )) C OiU0 2 

Furthermore, we can choose two strictly smaller open subsets 0[ C 0\ and 
0' 2 C O2 with the same conditions (6). Now consider T = Ti UT2, a curve which 
surrounds cr(T(z )) in (Oi U 2 ) \ {0[ U 0' 2 ). See figure 1. 



Im(^) 



Re(A,) 




Figure 1: Spectrum of T(zq). 



Due to lemma 2 and the continuity ofz h^ T(z) in zq, there exists a connected 
(choose a polydisc, for example) neighborhood V d CI oi zq such that 



Vz e V, a(T(z)) C O'j U O2 

and thus T surrounds <r(T(z)) in (Oi U 2 ) \ (0[ U 0' 2 ). 
For all z G V, we consider 



(7) 



P(z)= (\e-T(z))- i d\=l 0l (T(z)) 
'ri 



(8) 
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where lo t (A) = 1 if A € 0\ and lo 1 (A) = if A <G O2 (note that loi is therefore 
holomorphic in 0\ U O2). By lemma 8, the mapping z 1— ► -P(z) is holomorphic 
in V. 

Since lo 1 (A)lo 1 (A) = lo^A), the spectral mapping theorem (lemma 3) 
implies that 

P(z)P(z) = P(z) (9) 

Let M z = K{P) (range of P) and M z c = 7V(P) (null space of P). We have 
M z C\M C Z = {0} and X = M z + M c z (easy, see [5, p. 126]). Moreover, we have 

P(z)T(z) = T(z)P(z) (10) 

Indeed, since T(z) — X(T(z)) and Alo^A) = lo 1 (A)A, the spectral mapping 
theorem gives again the result. Hence T(z)M z C M z and T(z)M z C M z . 

The key point is that P(z) is compact. Indeed, for all A G 0±, we have (Ae — 
T(z))- 1 (Ae-r(z)) = e. Since # O u (Xe-T(z))- 1 - \e = \{\e-T{z))- l T{z) 
is compact. Since A 1— > je is holomorphic in 0\, we have J r ^e<iA = 0. Hence 



P(z) 



(\e-T(z))^- l -e 



d\ (11) 



is compact as a limit of compact operators. Hence M z = J\f(e — P{z)) is a finite 
dimensional vector space (lemma 5). 

However, it is no use to reduce directly T(z) by the projection P{z) because 
M z depends on z. In this case it is usual to introduce 

U(z) = P(zo)P(z) + (e-P(zo))(e-P(z)) (12) 

= e-[(P(z)-P(zo))P(z)-P(zo)(P(z)-P(zo))] (13) 

Since z <-^ P(z) is continuous, there exists a neighborhood V of zq such that 
\(P(z) P(zo))P(z) - P(zo)(P(z) P(zo))\ < 1 

for all z G V. Thus U(z) is invertible. Since it is obviously holomorphic in V 
(use lemma 7), so is U(z)^ 1 (lemma 6). On the other hand, it is clear that, for 

all z eV, 

U(z)M z = M Z0 

U(z)M c z = M c Zo (14) 

Now consider the following operator (for all z G V) 

B(z)^U(z)[e-T(z)]U(z)- 1 (15) 

B(z) is splited into two parts. 

B(z) = B x {z)P{zo) + B 2 (z)(e - P(z Q )) (16) 

where : 
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• Bi(«) = B{z)\ Mzo - M ZQ — ♦ M Z0 

Since M Zo is finite dimensional, we can use the Cramer's rule as in lemma 
9. The only thing to prove is that det(B\(z)) can not be identically null 
in V. Suppose the contrary. B{z) is nowhere invertiblc in V. Thus 
1 G a(T(z)) for all z G V. Consequently z is an interior point of V = 
{z G ft : 1 G a(T(z))}. Thus V is a non empty open subset of fi. We 
are going to show that V is also closed. Let z' be an accumulation point 
of V . There exist z' n G V such that z' n — ► z' when n — > +oo. Since 
"P is closed, z' £ V. The same construction as above can be done for 
z' . In particular, there exists a connected neighborhood V' of z' where 
B'(t) = B[(t)P'{z') + B' 2 (i)(e - P'{z')). There exists n G N such that 
2^ G V. Since z' n E V, there exists an open neighborhood W of z^ such 
that W <Z P. Since -B 2 (£) is invertible (see below), B[(t) is not. Hence 
det(B[(t)) = in Wr\V, thus in V because V is connected. This proves 
z'eP. Since P is a non empty open and closed subset of the connected set 
fl, we have V = f2, which is in contradiction with the hypothesis (e — T(z)) 
invertible in B(X) for some z G ft. 
Thus det(Bi(z)) ^ in V and 2 ^ Bi(z)^ 1 is meromorphic in V. 

. B 2 (z)\ M . o :M c Zo ^M c Zo 

We have B 2 {z)x = U(z)[e -T(z)(e - P(z))]U{z)- l x, for all x G M C ZQ . 
Since we have T(z)(e — P{z)) = Alo 2 (T(z)), the spectral mapping the- 
orem gives a{T(z)(e - P(z))) = (Al 02 )(cr(T(z)), hence 1 g cr(T(z)(e - 
P(«))). Thus [e-T(z)(e-P(z))] is invertible, and so is B2O). Thus 
z 1— ► B 2 (z)~ 1 is holomorphic in V (lemma 6). 

Hence z h^ _B(z) _1 = _Bi(z) _1 P(zo) + -E>2(z)~ 1 (e — P(zo)) is meromorphic in V. 
Thus so is z 1— ► [e — T(z)] = U(z)B(z)~ 1 U(z)~ 1 and this ends the proof of 
theorem 1. 



6 Final comments 

In the above proof we have shown that the interior of the polar set V = {z G 
ft : 1 G a(T(z))} is empty. On the other hand, V is an analytic set and has 
other properties not discussed here (for example, see [3]). 

In the end let us mention some links between Steinberg alternative and Fred- 
holm's one. 

Fredholm alternative. Let T G B(X) be compact. Then the following alter- 
native holds : 

(i) either e — T is invertible (hence A/"(e — T) = {0} and equation x — Tx = f 
has a unique solution for all f ); 

(ii) either < dimj\f(e — T) < 00 (equation x — Tx = f has infinitly many 
solutions which form a vector space of dimension dimj\f(e — T) if f G 
lZ(e — T), and has no solution if not.). 
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Both are easy in finite dimension, and they use compacity for running in infi- 
nite dimension. Now consider a holomorphic compact operator-valued function 
z i— ► T(z) in a connected set £1 C C d . If e — T(z) is nowhere invertiblc in il (case 
(i) of Steinberg alternative) then case (ii) of Frcdholm alternative holds. Or else 
(e — T(z))~ 1 is meromorphic in fl (case (ii) of Steinberg alternative). Hence, for 
every zq € £1, either z i— > (e — T(z))~ 1 is holomorphic in a neighborhood of zq, 
where case (i) of Frcdholm alternative holds, either e — T(zq) is not invertible 
and case (ii) of Fredholm alternative holds. 
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ON A NEW FAMILY OF GAUSSIAN QUADRATURE 

FORMULAE OF BIRKHOFF TYPE WITH APPLICATIONS TO 

POLYNOMIAL INEQUALITIES 

ALLAL GUESSAB 



Abstract. A new family of Gauss type quadrature formulae, which contain 
terms involving certain derivatives at an interior point in the interval of integra- 
tion, is first presented and discussed. It is then shown how these quadratures 
can be constructed via eigenvalues and eigenvectors of a real symmetric tridi- 
agonal matrix. Numerical comparisons arc made with the classical Gaussian 
quadrature formulae. We also derive new explicit sharp weighted L 2 inequali- 
ties for coefficients of polynomials. They are all obtained as direct applications 
of the quadrature formulae created in this paper. 



1. Introduction and motivations 

The aim of this series of papers is to propose and analyze methods for developing 
efficient algorithms for the numerical computation of a new class of quadrature 
formulae. The discussion of [17] and [27] is partly concerned with quadratures that 
contain boundary terms involving some derivatives at both endpoints of the interval 
of integration. These quadrature formulae have the novel feature that they can best 
be appreciated in the case of integrals with severely ill-behaved integrands such as 
those with singularities at one or both endpoint. However, such techniques or 
popular methods are often computationally inefficient when integrating a function 
that has both "interior" regions with large functional variation and regions with 
small functional variation. This is entirely due to the excessive number of points 
required to achieve a reasonable amount of accuracy. In this part of the series, 
we shall present and construct a different class of quadratures which uses terms 
involving some well-determined derivatives of the integrand at certain sampling 
interior nodes fixed in advance and suitably chosen in the interval of integration. 
The remaining (free) nodes and weights are determined so as enable a minimum 
number of function evaluations. 

The choice of an appropriate quadrature for solving a numerical integration 
problem depends on the form and content of helpful singularity information about 
the integrand -like important information about the values of specific points or 
bounds of certain derivatives-. The most effective way we found for exploiting such 
additional informations is to incorporate in the quadrature formula some of these 
values, that can be chosen judiciously at locations where the integrand function 
is predominant. Thus, this approach may indeed have a strong influence on the 
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accurately and efficiently of such quadratures, because, as already expressed by 
Christoffcl (cf. [18, p. 86]), it often helps to find quadratures extremely well suited 
for numerical treatment in practice and it gives good ways of taking effective use 
of available information about certain smoothness properties of the integrand. 

The main purpose of this paper is centered around the development of a simple 
procedure for computing numerically the nodes xi^ m;a — zi,m,n{d<T;ji,...,j m ) and 
the weights \ Lm ^ n = Xi tm ,n(d<J;ji, -,3m) and Wj iiTn , n = uj jit m in (da; j lt ...,j m ) , such 
that the new quadrature formula, 

m n 

(1-1) Qa,m,n{f;jl,—,jm) = /]Wji,m,nr Ji ' (0) + /] M,m,nf{xi,m,n), 

»=1 1=1 

has maximum polynomial degree of exactness (MPDE) 

MPDE{Q a>m>n ) = 2n + 2ro - 1, 
i. e., 

Uf) = J\f(x)d<j(x) 

= Qcr,m,n{f',jl, ■■■,3m), V/ G 7 , 2n+2m-l • 

Here and subsequently ji, ...,j m , are given distinct real arranged in increasing order 
such that 2ji < In + 2™ — 1, i = 1, ...,m. For simplicity of presentation, we shall 
suppose that <i<7 is a even (nonnegative) measure with symmetric support with 
respect to the origin on the interval [— 1, 1] and all of whose moments exist. It is 
easy to modify the results to cover the more general case of an arbitrary nonnegative 
measure. Note that, these formulae share the property of the classical Gaussian 
formula that a formula with N — n + m evaluations is exact for all polynomials of 
degree < 2N — 1. Thus, they can be viewed as natural extension of the standard 
Gaussian quadrature. 

The problem stated, in the special case j\ = i, i = 1, ...,m, is classical, and the 
best of our knowledge there are only two methods for computing a such type of 
quadrature. One of them [41] is based on solving by Newton's method the nonlinear 
moment equations expressing exactness of the formula. Such a procedure is subject 
to considerable loss of accuracy and therefore required elevated precision. This is 
considered its main disadvantage. The other [24] is a very general algorithm that 
uses among other tools the full eigenvector matrix of a tridiagonal matrix derived 
by matrix decomposition methods. The weights are then obtained using general 
methods for constructing interpolatory quadrature formulae [15]. Except for their 
availability only for small n, both these methods lack the rapid convergence and 
elegance of the well-known algorithms of Golub and Welch [23] and Gautschi [20, 
p. 86] for ordinary Gaussian case (case where preassigned nodes do not exist, i. e., 
m — 0). Apart from these rather Hermitian cases, surprisingly no effective methods 
is currently available to compute Birkhoff-Hermite type case. 

For these reasons, we approach the problem of constructing the functional Qk.n 
differently Here, our principal results give a numerical algorithm which offers 
an efficient way of reducing the computation of the functional Qa,m.n (i- c., the 
computation of the nodes x; imi „ and weights A;. m ,„) to the well-studied problem of 
computing ordinary Gaussian quadrature formulae from recurrence coefficients, and 
can therefore be brought into the realm of stable modern methods of constructing 
orthogonal polynomials; see Gautschi [18]. The only overhead of this computation 
is the construction of new Jacobi matrix by changing some recurrence coefficients 
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which characterize the original measure da. This is a crucial aspect of the method, 
since both the programming task and computational effort are elevated consider- 
ably. 

The approach on this topic was stimulated by a request from a theoretical physi- 
cist to find a good quadrature which contains some derivatives at given interior 
nodes. However, the approach and techniques used here, which extend some of the 
results in [7], have been strongly motivated by the problem of Turan [43, Problem 
XXXIII] , who asked if there exists a quadrature formula of the form 

(1.2) j f(x)da(x)^J2(X t f(x l )+S l f k> (x l )), 

which is exact for all / £ V-m- In this form the problem has been solved, in the 
case k = 1, by Dimitrov [12] and extended, for arbitrary k, by the same author in 
[13] constructing a quadrature formula which uses exactly In function evaluations. 
It is naturally led to ask whether a quadrature formula of type (1.2) might have a 
minimal number of function evaluations. It should be observed that since da is even 
and the support of da is symmetric with respect to the origin then the free nodes 
of (1.1) are located symmetrically in the interval (— 1, 1) . Consequently, when n is 
odd then, due the symmetry, one of the nodes will coincide with the origin. Thus, 
if 77i = 1, the present work offers a simple solution for Turan problem. We also 
remark that not only the latter is more pleasing than those presented in [12, 13] 
since it requires considerably few evaluations (n + 1 instead of 2n) and it can be 
computed most conveniently as eigenvalues of a symmetric tridiagonal matrix, but 
additionally it achieves the highest possible polynomial exactness among all nested 
quadrature formulae which use the same number of function evaluations. 

Furthermore, this new class of quadrature formulae can find a wider range of 
applicability than merely to the particular type of problems considered here. In 
fact, in many physical applications it is not uncommon that no only the values 
of a function are accessible, but also some of its derivatives. In such a situation, 
it is meaningful to employ quadrature formulae involving these derivative values. 
An important method in which such type of quadratures can be applied in nat- 
ural way is in the numerical solution of partial differential equations by spectral 
approximation. See, in this connection, Bernadi and Maday [4] where generalized 
Lobatto-type formulae have been successfully used, and the author's joint work 
[17] where efficient spectral methods based on a new family of quadrature formulae 
where complex boundary conditions terms were included. 

The remaining six sections of this paper are organized as follows: In the next 
section, we prove existence and uniqueness of the quadratures we are seeking (1.1) 
and we also develop some of their properties. Then, in section 3, we give a detailed 
description of their characterization, and after that we present some properties 
of the Jacobi matrix associated with (1.1). The proofs are constructive, so it is 
straightforward to develop an algorithm which can compute (1.1). In section 4, 
we apply these properties together with the characterization results of section 3 
to construct a class of quadrature formula of type (1.1), we also solve a classical 
integration problem of Turan [43, Problem XXXIII]. Numerical examples demon- 
strating the efficiency and accuracy of our quadrature formulae, as well as some 
comparisons with the classical Gauss quadrature formula are presented in section 
5. Section 6 offers some sharp I? inequalities for coefficients of polynomials. These 
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inequalities arc all obtained as direct applications of the quadrature formulae cre- 
ated in the previous sections. Finally, we give some concluding remarks, followed 
by an appendix in which we include some of more technically involved proofs of 
Section 5 and 6. 

Remark 1. Throughout this paper, we will take the "standardized" integration 
interval [—1,1] as a matter of convenience, since any finite interval [a,b] can be 
mapped onto [— 1, 1] via an affine transformation and the resulting transformed in- 
tegration formula has the same MPDE. To simplify the presentation, we only seek 
quadrature formulae which contain terms involving some derivatives at 0. The 
reader should bear in mind that it is, in particular, for integrands, with a (inte- 
grable) singularity or with a high peak at or near as well as functions that 
are highly oscillatory in a neighborhood of 0, that this quadrature formula is de- 
signed specifically. When the integrands have several peaks or an arbitrary number 
of internal point singularity, we suggest to subdivide the interval [— 1, 1] into several 
subintervals, in such a way that the integrand has a peak or a singularity at the 
midpoint of each subinterval, and applying the quadrature formula (1.1) (using, if 
necessary, an appropriate linear transformation) to each subinterval separately. A 
consequence of this approach is that the most significant part of the analysis need 
only be carried out on the reference domain. 

2. Existence of the quadratures and some of their properties 

Our first result states that there exists one and only one quadrature formula of 
type (1.1). It is also shown that (1.1) possesses most of the desirable properties 
of the classical polynomial Gaussian quadrature formulae, namely, the interlacing 
property of the free nodes, their inclusion in the support of the measure, and even 
more importantly the positivity of all weights Xi. m ,n- These, as is known classically, 
are important properties that numerical quadrature formulae required to have. 

We first consider the problem of the existence and uniqueness of (1.1). 

Theorem 1. Given a nonnegative even measure da and nonnegative integers n,m, 
Jii J2> ■■■ijm with n > 1, then there is precisely one quadrature formula of type (1.1), 
which integrates exactly all polynomial ofp2n+2m-i- The nodes X\, m , n , . . . ,X n , m ,n 
are all in the open interval (— 1, 1), and their weights X\ ym ,n, ■ ■ ■ , X n . m _ n are positive. 

Proof. The proof is adapted version of the proof of Theorem 1 of the author's joint 
work [7]. 

I) Uniqueness. Suppose z = {zi, m , n , ..., z„,m,n} and y = {t/i, m ,„, ...,j/„, m ,„} 
were both sets of nodes such that the quadrature formula (1.1) based on each of 
them were exact for all polynomials from 7 7 2n+2m-i- Suppose that there exists 
io € {1, ...,n} such that z i iTO; „ ^ y i lTO , n . We now show that this impossible. In 
fact, by the well-known Atkinson-Sharma interpolation theorem [2], there exists a 
polynomial Pi a e T > 2n+2m-i such that 



i()\yi,m,n) 


= o, 


1 = 1,.. 


..,n, 




io \Zl,m,n) 


= o, 


1 = 1,.. 


..,n, 


I ¥" io, 


i {Zio,m,n) = 


= 1, 








f* } (0) = 

£* + D (0) s 


= o, 
= o, 


2=1,.. 
2=1,.. 


..,m, 
..,m, 
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(with the natural modification for the case: if z;. m ,„ = yi. m ,n then the conditions 
Pi ( z i., m ,n) = Pi (yi, m ,n) are replaced by P iQ {zi,m,n) = Pl (yi, m ,n))- The result now 
follows from the simple observation that if / = Pi and xi^ m _ n — yi. m ,n in (1-1) wc 
obtain 

Wo) - o, 

while if / = P l0 and £/ )TO) „ = Zj, m , n in (1.1) we obtain 

which is impossible, because \i (um ,n(z) > 0, so Z;,m,n = yi,m,n, I = l,—,n. On 
account of the uniqueness of the nodes x um _ ni I = 1, ..., n, it follows the equality of 
quadrature weights 

Aj, m ,„(2) = h, m ,n{y), I = 1, •••, n, 

and 

Wj i>m ,n(«) = ^ji,m,n{y), i=l,-.,m. 

This fact can be easily derived from interpolation. Thus from now on we may 
assume that the quadrature formula (1.1), if it exists, is unique. 

II) Existence result. In order to establish the existence of (1.1) let us consider 
the space S2n-i,2m defined by 

S 2n -i,2m = {fe V 2n+2m -i; P (2ji) (0) = P^ +1 >(0) = 0, i = l,...,™} . 

It follows from Atkinson-Sharma theorem that for any choice of ti, i — 1, . . . ,n, 
such that t\ < ... < t n and for all real data { y] > , there exists a unique 

I J i=l,j=0 

polynomial P G <!>2n-i,2m such that 

P(U) = y°, »=l,...,n, 
P'(ii) - yi, t = l,...,n. 

Thus S2n-i.2m is a space of dimension 2n, then by the classical Krcin theorem [29], 
there exists a unique quadrature formula of the form 

(2-1) / /(*) da( X ) = ]T tfm,nfKm,n) + <„(/), 

which integrates exactly all polynomials of <S2n-i,2m an d such that 



and 



A/L,„>0, i = l,...,n. 



We are now ready to prove the existence of (1.1). Let l2n+2m-i{f) be the interpo- 
lation polynomial of T > 2n+2m-i based on the information 

{/« ro ,J, ffe.J, i = l,...,n; /< 2 *>(0), .f (2 ^ +1) (0), i = l,...,m}. 

Then, it is well-known that 

In m 2jj + l 

/ 2 „ +2m -i (/)(*) = EE^(4J*toW + £ E / (fc) (0)Pfc,i, ro ,n(x), 

j=0 (=1 i=l fc=2ji 
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where hjj /m _ n , Pk,i, m ,n are the so-called fundamental polynomial functions. When 
f(x) is a polynomial of degree < 2n+2m—l, the interpolation polynomial coincides 
with f(x). Thus 

In m 2ji+l 

/w = EE/ (J) (^«)' i ^( i )+E E / (fc) (o)ft, 1 , m ,«w- 

j=0 1=1 i=\ k=2ji 

when / G T > 2n+2m-i- It follows immediately that an integration of Ii n +2m-\(f) 

leads to the following quadrature formula, 

(2.2) 

In m 2ji+l 



/I in m ^Jii-i 

/W<fc(*) = EEO W (4n)+E E ^ln/ (fe) (0),V/G7> ; 
_1 j=0 2=1 i=l fc=2ji 

where 

/•l 

x ?,Ln = h Jt i^ n (x)da(x), 



2n+2m-l, 



and 

/■l 
(2.3) w i,m,n = / Pfc.i.m.r^aOcKa;). 



3,m,ro 



Note that h\i mn G 1S271-1 2m, for all Z = l,...,n, and vanishes at the nodes of 

l,m,n 



(2.1), then applying (2.1) to /ii,2, m , n , we obtain A| ' = 0, / = l,...,n. We have 



a ^ so Kmn — ^Fmn' ' = !>••■: n , this fact follows immediately by applying the 
quadrature formulae (2.1) and (2.2) to /io,z,m,n, I = l,...,n. Hence, all weights of 
the free nodes in (2.2) are positive. 

Now, it only remains to prove that oj\^ n = 0, i = 1, ...,m. For this we use 
(2.3), the symmetry of da and the fact that P2j i +i,i,m,m i = l,...,JTi, are odd 
functions. This shows that (2.2) is a quadrature formula of the unique one (1.1) 
and thereby establishes the existence and uniqueness of (1.1). □ 

To be useful in practice, it is particularly desirable that an excellent quadrature 
formula should have the interlacing property The latter is, for example, shared by 
all the Gaussian quadrature. Our next theorem shows that for the formula (1.1) 
this holds true. 

Theorem 2. Given a nonnegative even measure da and nonnegative integers n,m, 
jii 32, ■■■ijm with n > 1. Assume that 

/\ m n 

/(») da(x) = J2 UJ i ,m,nf {2Ji) {0) + J2 Km,nf{xi, m ,n),yf & P2n+2m-l, 
- 1 i=l 1=1 

and 

/l m n+1 

f(x)da(x) = ^Wj i)m , n +l/( 2;7<) (0) + ^2 \l, m ,n+lf(xi, m ,n+l)y.f £ V2n+2m+l, 
"I ™ — 1 7 — 1 



1 = 1 



with —1 < xi^ mn < . . . < x nmn < 1 and —1 < xx mn j r x < . . . < £n+i,ro,n.+i < 1- 
Then the following interlacing property holds: 

1 ^ ^l,m, n-t-1 ^ *^l,m,n ^ ••• ^ X nrn _ n <. X n -\-i mn -\-i <* 1. 
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Proof. Arguing by contradiction, wc suppose there exists /i with 

*£jU,m,n ^ \<Kfj,,m : n-\-l 5 •E/i+l,m,n+l ) • 

Define the polynomial P^ G P2n+2m~\ by the interpolation conditions 



*fi\'El,m,n) U, t 1, 



. 77. . 



Pix{xi, m ,n+i) = 0, Z = 1, ....,n+ 1, Z ^ /z and Z ^ /i + 1, 

ff Ji) (0) = 0, i = l,...,m, 

pW«+i)( ) = 0, t = l,...,m. 

(As in the proof of Theorem 1, the modifications necessary are obvious if some 
nodes xi. m ^ n and Xi tm ,n+i coincide.) The existence of P^ follows from the theorem 
of Atkinson-Sharma once more. Taking into account that the total number of zeros 
of P M on [— 1, 1] does not exceed 2n+2m— 1 it easily follows that P/ J ,(x IJ ,+i. m . n +i) > 0. 
Therefore, 

(2.5) f P^x)da(x)=0 

by (2.4). But, by the second quadrature formula we have 
l 

-1 

We remark that (2.5) cannot hold because 

"fj, 7 m,n+l* \l\X ii,m,n+l ) > '^+l,m,n-)-l-'/^l,'£/A+l,m,n+l ) ^ U, 

which gives a contradiction. This shows the required interlacing property. □ 

3. Characterization Theorems 

The main result of this section shows that the free nodes of (1.1) are zeros 
of a certain orthogonal polynomial with respect to a linear functional / . We 
also establish that the weights relative to the interior nodes are proportional to 
the squares of the first components of the orthonormal eigenvectors, of the Jacobi 
matrix of order n associated with I . Then, we identify the orthogonal polynomial 
with respect to / as a quasi-orthogonal polynomial which can be represented 
as characteristic polynomial of a symmetric tridiagonal matrix, this is the key to 
our analysis. Thus, the interior nodes and weights of (1.1) can then be computed 
directly by standard software for Gaussian quadrature formulae. This will facilitate 
considerably the computation of the functional Q a ,m.n- 

In order to formulate the problem precisely, we first give some addtional notation 
and formal definitions as well as some known results. A further particular measure, 
which will play an essential role, is 

(3.1) da{x) =x 2l ™ +2 do-(x). 

We also introduce the following quadrature formula, 

A _ n 2jm+2 

(3-2) Qd<7,m,n\J'i3lT--iJm,) ~ L-, X l,m.n M,m,nJ [%l,m,n ) i 

1=1 
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in which x^ m . n , A/ im>n , / = 1, ...,n, are respectively the interior nodes and their 
respective weights in (1.1). Notice the new quadrature formula satisfies 

^ld£,m,n\J'i Jl i •■■! Jm) — Wcr,m,n\X J ', J\ , ..., J m ) ■ 



A 



Thus, Q dS m n is positive ( all its weights are positive) and it integrates exactly all 
polynomials from T-^n-r-i, where, for notational convenience, we denoted by r the 
following integer 

(3.3) r = 2j m - 2m + 2. 

Here, as well as in the remaining of this paper, we assume that such a integer has 
been given which satisfies 

(3.4) 2 < r < n. 

Let now the "nodes polynomial", which we write, for future convenience, q n . r , 

n 

(3.5) q n ,r{ X ) = H^ _ x l,m,n) 

1=1 

having as zeros the interior nodes xi_ m>n , I = 1, ...,n, of (1.1). 
Let / G Vin-i, dividing / by the polynomial q n<r gives 

(3.6) f(x) = q(x)q nt r(x) + r qnr {x), 

where both q and r Qn r are unique polynomials of degree less than or equal n — 1 . 
We follow the authors [40], [46] and define a linear functional associated with q n r 
by: 

1 j / ^ ^„. r (/)=/il^».rW^W- 

Note that the form I qn r depends on da, but for convenience we have suppressed 
this dependence to simplify the notation. This functional is positive on Vin-\ in 
the following sense: for every nonnegative polynomial / € Vi n -\i we have 

C(/)>0, 
and 

(3.8) /,„,(/) = 0, only if/ = 0. 

The proof of this result is essentially the same as the proof in [40, p. 397]. We 
include it here for the sake of completeness. It is easily shown, since the quadrature 
formula (3.2) is exact for all elements of V n -i (C T > 2n-r-i), that the functional 
I qnr satisfies 

hnM) = J-i r q n A x ) dd ( x )' 

A 

in view of r qnr ( x i,m,n) = f{xi,m,n), I = l,-,n. So, we have 

A 
*q n ,r \J ) = ^dS,m,n\J'j Jl) •■•) Jm)- 
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Thus, since Qdd. m ,n i s positive (all its coefficients are positive), we deduce that 
Iq n r (f) > for all nonncgativc polynomial / of V2n-i, and it remains to show 
that the only solution of I qn r (f) = is the zero solution. This is done by applying 
Lukacs representation theorem for nonncgativc polynomials (sec Szcgo [42, Theo- 
rem 1.2.2, p. 4]), which ensures that if / £ V<in-\ and nonnegative in [—1, 1], then 
/ can be represented in the form 

f(x) = (1 + x)C 2 {x) + (1 - x)D 2 {x) 

for some polynomials C and D G V n -\- As such and if it is non-trivial polynomial, 
then it has the property that it cannot vanish at all the nodes x;. mi „, I — 1, ...,n. 

The positivity of the linear functional / (3-7) on Vi n -\ gives rise to the 
concept of orthogonality with respect to the functional I qn r : Two polynomials 
u and v are orthogonal with respect to I qn r if I qn r (uv) = (see [10, Chapter 1, 
Section 2]). In particular, there exists unique monic polynomials 7r„(.) = 7r„(.,7 ) 
of degree n orthogonal with respect to the functional / to all polynomials of 
lower degree. These orthogonal polynomials can conveniently be represented by the 
tridiagonal Jacobi matrix 3 n (I ) of coefficients of the recurrence relation which 
they satisfy. 

We begin by showing an essential property of the interior nodes of (1.1), which 
we record for later use. 

Theorem 3. Suppose r, q n _ r and I Qn r are given respectively as in (3.3), (3.5) and 
(3.7). Suppose further that Xi )TOin , ..., z n ,m,n ore n-points on the interval (—1,1) , 
such that Xi >mtTl ^ Xj,m,n f or a ll i 7^ 3- Then the n nodes £i )TOirj ,, ..., x n , m ^ n are the 
free nodes of the quadrature formula (1.1) if and only if they are precisely roots of 
the monic orthogonal polynomial 7r n . 

Proof. We prove this result by using some elementary argument and showing that 
w n (.,I ) is in fact q n . r . To show this, observe that 

A 
(3.9) I Vnir (Qn,rP) = Qda,m,n(Qn,rP;jl,—,jm), for all p € "P„_i 

Thus, since 7r„(., I ) (assumed monic of degree n) is uniquely determined by (3.9), 
this identifies 7r„(., / ) as q n , r - The rest of proof proceeds exactly as usual. □ 

The following interesting corollary, which is obvious from Theorem 3, gives a 
simple and an important characterization of the free nodes in (1.1) that are recog- 
nized as eigenvalues of a symmetric tridiagonal matrix. Wc will use this fact later 
to efficiently compute them. 

Corollary 1. Suppose r, q nr and I qn r are given respectively as in (3.3), (3.5) and 
(3.7). Then the n nodes Xi )TOl „, ■■■,x ntm>n are free nodes of the quadrature formula 
(1.1) if and only if they are the eigenvalue of the Jacobi matrix 3 n (I ) which 
defines the orthogonal polynomial with respect to the functional I Qn r . 

Let us now suppose that the interior n-nodes of (1.1) are be found as eigenvalues 
of the symmetric tridiagonal matrix 3 n (I ) of Corollary 1. We now establish the 
following result, which simply says that the weights A;, mi „, I = 1, ..., n, of (1.1) are 
proportional to the squares of the first components of the normalized eigenvectors 
of 3 n (I ). This result will later be the key to efficient computation of A; iTOi „, 
1= l,.T,n, of (1.1). 
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Theorem 4. Suppose the symmetric tridiagonal matrix J n (I ) is given as in 
Corollary 1. Suppose the eigenvectors of 3 n (I ) are calculated such that 

Jn^JVj =I|,m,nVi, l = l,...,Tl, 

with VfVi = 1 and Vf = (vij, ..., V nt i). Then, the weights Xi, m ,n, I = 1, ...,»t, can 
be expressible in terms of the first components Vi t i o/V; by 

v 2 n 



X l m n •> —1 



.rti.n 



(3.10) h, m ,n = 2j~+2 / da(x), I = l,...,n, (7 is smc/i i/iai ir; lTO ,„ 7^ 0). 



Proof. The proof of our result is facilitated by showing that the quadrature formula 
(3.2) is precisely the Gaussian one for the functional / (3-7). Indeed, this 
fact can be proved, if one observes, in view of the exactness of (3.2) on V n -\ 
(c T J 2n-r-i), that for all polynomial of degree < n — 1 

V,(/) = I^mdaix), 

(O.llJ A 

— ^*dS,m,n\J i Jl> •"> 3m)- 

A 

Hence, the weights of Q d $ m n and those of the Gaussian quadrature formula (rel- 
ative to / ) are then identical. This result is a trivial consequence of the well 
known fact that the weights of a quadrature formula, which is exact on V n -\ and 
uses n distinct nodes, is uniquely determined. Consequently, the weights can be 
obtained uniquely from the linear system 

( 3 - 12 ) W/) - Q<»,m,„(/;Jl.-,Jm), all /€*>„_!. 

A number of methods are known for their construction. However, the equa- 
tions (3.12) do not lend to constructive purposes because of the well-known ill- 
conditioning associated with power moments, which is highly inconvenient for prac- 
tical use. In order to combat the high condition number, it is more effective to 
reduce this problem to a eigenvalue problem with respect to the functional / 

By the Gram-Schmidt orthogonalization process, the positivity of the linear func- 
tional / (3-7) on Vi n -\ implies the existence of a unique system of polynomials 

\ Qk(-) = qk(-, I g )j k — 0, ..., n > , qk being of exact degree k, that are orthonor- 

mal with respect to I . We observe that q n — q n>r / \ J_ 1 q„ r (x)da(x) and, since 

< r < n, an easy calculation shows that we also have go = Vv/-i d&(x). 

The remainder of the proof may be obtained by using a trivial modification of the 
argument for the simplest common case of Golub and Welsch [22, p. 223]. Indeed, 
it follows, from the fact that q^ are orthonormal polynomials for I qn r and from the 
well-known result of the zeros and the Christoffcl-Darbout formula satisfied by all 
orthogonal polynomials, that if I is such that xi. m _ n =/= then the weights Xi t m,n 
can be expressed in terms of qo, ..., q n -i, by 

(3-13) x Lm.n *l,m,n — „„_i : ^7 1 = 1, ...,Tl. 

Z^fc=0 [Qk{Xl, m ,n)\ 

Finally, to complete the proof of Theorem 4, we use the fact that X)fe=o v t i = ^' 
to prove that v\ l = Xi. m , n [qo( x i,m,n)} ■ This establishes the theorem. □ 
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We remark that, since Q d $ m „(.; ji, ..., j m ) is a Gaussian quadrature formula 
for the functional I qn , the positivity of the weights Xi t m,n, I — l,...,n, and the 
interlacing property of the nodes Xi tm ,n-, I = 1, ■■■, n , proved in Section 2, are thereby 
guaranteed at once, because these properties are shared by all Gaussian quadrature 
formulae. 

Remark 2. We do not know the functional I , but we do know a lot about it: 
We defer until Section 4 other aspects of its numerical implementation, such as the 
explicit or numerical construction of the coefficients of the three-term recurrence 
(or equivalently the Jacobi matrice) which define the polynomials orthogonal with 
respect to I . Note that, from an implementation standpoint, an important prac- 
tical aspect of our characterization of the weights is that the expression given in 
(3.10) depend only on J n (J ) and da . We remark also that the elegant relation- 
ships (3.10) can easily be extended to the case of an arbitrary positive quadrature 
formula. 

Following the same path as in [17] and [27], the next step in our quest for 
the quadrature formula (1.1) identifies the quasi-orthogonal polynomial (3.5). For 
this purpose, we first introduce the space /C n +2j rTl +i, which is useful subsequently, 

by 

(3.14) 

/C„ +2jm+ i = {P = q n , r R € Vn+2 jm +i; P (2ji) (0)= P^ +1 \0), i = l,...,m}. 

Again, using Atkinson- Sharma theorem, it is easy to verify that the dimension of 
/C„ + 2j m +i is r. Consequently, we may construct a basis {^o, ■••> ^r-i} for /C n +2j m +i 
of the form 

(3.15) *i(x) = x n+2m+l + Ri(x), i = 0,...,r-l, 

with Ri belonging to "P„+2m-i- In order to define a basis of 'P2n+2m-ij which 
will play an essential role for the proof of Theorem 5, we introduce the following 
polynomials 

(3.16) * fc (x) = x 2m+r+k q n:r {x), k = 0, ..., n-r-1. 

Observe that these polynomials satisfy the following orthogonality relations, 



/ $ fe (x) da{x) = 0, fc = 0,.. 



(3.17) / $ fe (x) da{x) = 0, k = 0,...,n-r- 1 

Note also that we have for, i = 1, ...,m, 

(3.18) $<: 2 ^(0)=$p +1) (0) = 0, k = 0,...,n-r-l. 

The relations (3.17) follow immediately from the quasi-orthogonality of q nr (see 
(3.20)) and the fact that 

J_ x &k(x) do~(x) — f.x k q n . r (x)da(x), k = 0, ...,n — r — 1. 
Thus, it can be easily proved that the following collection of polynomials 

(3.19) {x\ i = 0,...,n + 2m-l; tfj, j = 0, ...,r - 1; $ fe , k = 0, ...,n - r - 1} , 

forms a basis for 7 ? 2n+2m-i- I n the proof of the next theorem we shall have a use 
for this observation. 
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We are now in position to present our main result of this section, which is 
undoubtedly the key point to develop an algorithm for the computation of the 
functional Q a .m,n- 

Theorem 5. Given a nonnegative even measure da and nonnegative integers n,m, 
jii 32i---i3m with n > 1. Suppose r and the functions da, ^q, ..., \I/ r _i are given 
respectively as in (3.3), (3.1) and (3.19). Suppose further that X\, m ,ni •••> %n,m,n 
are n-points located symmetrically in the interval (— 1, 1) , such that a;j jTO) „ ^ Xj, m ,n 
for all i ^ j. Then the n nodes xi ym . n , ...,!„_„ are the free nodes of the quadrature 
formula (1.1) if and only if they are zeros of a quasi-orthogonal polynomial g„ r of 
degree n and order r with respect to da (= x 3m+ da), i. e. 

(3.20) / q n . r (x)p(x)da(x) = 0, for all p e V n - r -\, 

such that 

(3.21) j\^ t {x)da{x) = 0, i = 0,...,r-l. 

Proof. For the sake of notational simplicity the dependence of Qda,m,n{f\ ii; -Jm) 
on ji, ...,j m , will from on be suppressed, therefore we denote 

Wda,m,n\J: Jit •••; Jm) — Wda,m,n\J ) •> 

A 

where Qda,m.n is the quadrature formula defined in (3.2). We recall that Q dS mn 
integrates exactly all polynomials from T^n-r-i- 

Necessity. Assume that the nodes X\, m ,m ■■■,x n ^ m . n , are those of the quadrature 
formula (1.1). The first part of the result follows directly from the fact that the 
quadrature formula (1.1) is exact on Vin-r-i- I n fact, suppose that p has degree 
less that or equal n — r — 1. Since q n ^ r (x)p(x) is of degree at most In — r — 1 and 
the quadrature formula (3.2) is exact for elements of Vi n -r-\, we have 

l A 

/_! qn,r(x)p{x) da(x) = Qda,m,n(Qn,rP), 

= o, 

therefore, q n ^ r is a quasi-orthogonal polynomial of degree n and order r with respect 
to the measure da. 

For the second result, note first that, since r < n then for alH = 0, ..., r — 1, ^i is 
a polynomial of degree < 2n + 2m — 1. Thus, the exactness of Q a ,m.n on T J 2n+2m-i 
gives for all i — 0, ..., r — 1, 

j\^ l {x)da{x) = Q a , m ,n{^i), 

= 0. 

Which proves the necessity of the condition asserted in Theorem 5. 

Sufficiency. Assume that (3.21) holds and that the polynomial q nr is a quasi- 
orthogonal polynomials of degree n and order r having n distinct zeros xi_ m ^ n , ..., 
x n ,m,n on (-1, 1) . For a given function / on (-1, 1), we denote by 1 n +2 m -\{f\ ■), 
{hi t m,n,P2ji,i,m.n,P2ji+i.i,m.n} respectively the interpolation polynomial and the set 
of nodal basis functions of V n +2m-i with respect to the data 

{f(xi, m ,n), 1=1, ...,n; /«*>(<)), / (2ji+1) (0), i = l,...,m}. 
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The existence of l n+2m -i(f] •) and {hi imtn ,p2j i ,i,m,n,P2j i +i,i,m,n} follows at once 
from Atkinson-Sharma theorem. Then, we have 

In+k-l(f)(x) = ^2f(Xl,m,n)hl, m , n (x) +^2 E ^ ( Q )Pk,i,m,n(x). 
1 = 1 i=l k=2ji 

Since T n+ 2m-i(f',x) = f{x) for all / G V n +2m-i, wc obtain the following quadra- 
ture formula 
(3.22) 

/in m 2ji+l 

f( X ) da( X ) = Yl X l,m,nf(xi, m ,n) + E E "S.n/^C )' V / G ^n+2 ro -l, 
1 ;=i i=i fe=2ji 

where 

k,m,n= hi im>n (x)da(x), 

and 

/■l 

(3.23) Vi k m,n= Pka,m,n{x)d<j{x). 



To complete the proof of the theorem it remains to show that (3.22) integrates 
exactly, all polynomials of degree less than or equal to 2n + 2m — 1. This is eas- 
ily accomplished by using (3.19) that every polynomial P from T , 2n+2m-i can be 
expressed uniquely as 

n—i — 1 r— 1 

(3.24) P(x)= E a k ^k(x)+J2 b o^j( x ) + R ( x )' 

fe=0 j=o 

where R is a polynomial of degree less than n + 1m — 1 . The integration of (7) 
leads to, 

71 — T — 1 7" — 1 

j\p(x)da(x) = £ a fc £ 1 fc fc (aOd«0+£& J -/_i* J -(aO*KaO 

(0.25J fc = o .7=0 

+ X^ L 1 i?(a;) dcr(x). 

As for all k = 0, ...,n — r + 1, and j = 0, ...,r — 1, we have from (3.20) and (3.21) 
i ,i 

$k(x) da(x) = and / $j{x) da(x) = 0. 
• l J-i 

Therefore, by (3.22), we have 

(3.26) / P{x) da(x) = / R(x) dcr(x). 

Since R is a polynomials of degree at most n + 2m — 1, equations (3.22) and (3.22) 
imply that 

(3.27) J\p(x)da(x) = r=iA^ ro ,^(^, ro ,„)+^ 1 2 fe i+>| fe i n i?^(0). 
However, for I = 1, ...,n, and i = 1, ...,m, we have 

* J -(xi, m ,n) = *p } (0) = *(^ +1 )(0) = 0, j = 0,...,r-l, 
$ fe (z/, m ,n) = $r' l) (0) = *[ 2jl+1) (0) = 0, fc = 0, ...,n- r- 1, 
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which gives for all I = 1, ...,n, and i = l,...,m, 

i\Xl,m,n) ^\Xl,m,n) 5 

P( 2 ^)(0) = R^(0), 
p(%+i)( ) = 7j(2ji+i)(0). 

This implies that the quadrature formula (3.22) is exact for P, since 

1 n m ^ii + l ... 

(3.28) /VW^) - EW^J + E E41/ w (»)- 

1=1 i=l k=2ji 

Furthermore, since da is symmetric, and the support of da is symmetric with respect 
the origin, it follows easily from uniqueness that P2ji+i.i.m,m i = 1; ■■■,m, are odd 
polynomials. So that to\^ n = 0, i = l,...,m, hold trivially. This shows that 

(3.28) is a quadrature formula of the form (1.1). By Theorem 1, the quadrature 
formula (1.1) is unique, so the nodes xi, m _ n , I = 1, ...,n, are indeed those of (1.1). 
This completes the proof of Theorem 5. □ 

We recall that, since da is assumed an even function and its support is symmetric 
to the origin on [— 1, 1] , it follows from uniqueness that the nodes polynomial q n ^ r 
satisfy the following property: 

(3.29) q n<T {-x) = {-l) n q n , r (x). 

Thus, q n . r is an odd or even function depending on whether n is even or odd. 
In the discussion which follows, we suppose n = 2k + 1 and we denote by 7?j the 
monic orthogonal polynomial of exact degree i, that are orthogonal with respect 
to da. Since q nr is a linear combination of 7ro,^i,-"!^n) t ne orthogonality relations 
of the polynomials {7?;} and the symmetry property (3.29) yield that q n/r can be 
expressed uniquely as 

k 

qn.r (x) = 2jp»7T2i+l,(p2fe+l = !)• 

i=0 

So that, if the number of nodes n is odd then is a zero of the latter. Thus, 
a separate treatment is required for the case n odd, because in this case it is 
preferable to use the following characterization of the polynomial q n ^ r (x)/x, which 
can be useful in practice. 

Theorem 6. Given a nonnegative even measure da and nonnegative integers n, m, 
iii ji-f-tjm with n = 2k + 1 (k > 1). Suppose r, q n , r and the measure da are 
given respectively as in (3.3), (3.5) and (3.1). Then the polynomial q n>r (x)/x is a 
quasi-orthogonal polynomial of degree n — 1 and order r — 2 with respect to da, i.e. 

(3.30) / (q n _ r (x)/x)p(x)da(x) — 0, for all p &V n - r , 
or equivalently 

(r-2)/2 

(3.31) q n ,r(x)/x= ^ P2k-2i^2k-2i, (p2fe = l)- 

i=0 

Proof The argument is essentially the same that in Theorem 5, so we omit details. 

□ 
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Utilizing Theorem 5, Theorem 6 may be restated in term of the quasi-orthogonal 
polynomial q nr (x)/x. 

The computation of the functional Q a .m,n may be considered as solved, once 
the Jacobi matrix 3 n (I ) has been obtained. The basic idea of computing the 
coefficients in 3 n (I ) is as follows: 

The first step in our quest for 3 n (I ) computes the coefficients of the quasi- 
orthogonal polynomial q n _ r that represent it in the basis: 

{Tr k (.)=n k (.,dS), k = 0,...,n}. 

Thus, we first express the nodes polynomial q n ^ r in terms of monic orthogonal 
polynomials {7?fc} fc=0 ; therefore q nr may be written in the following form 

(3.32) q ntT . = 7T„ + Pl7T„_l + . . . + p r 7T„_ r , 

for some choice of real constants pi,...,p r , since, by virtue of Theorem 5 q n ^ r 
is orthogonal to V n - r -i with respect to the measure da. The coefficients {pkYk=i 
can then computed as follows: We substitute (3.32) into the equations (3.21) and a 
simple calculation shows that this gives rise to a (nonlinear) system of r equations 
with the unknown coefficients pk, k = 1, ..., r, ( note that the polynomials ^ are of 
the form \I/j = q n . r Ri for some Ri £ V2j m +i-) 

Thus finding these coefficients in q nr amounts to solving a r x r system of 
nonlinear equations. Our strategy is then to choose the solution in a way which 
will assure that the corresponding polynomial (3.32) has all its zeros on (— 1,1) . 
To clarify this fact, we will discuss this more fully in section (4) for the particular 
case r = 2. 

Once the underlying polynomial q n , r is available, we then appeal to Corollary 
1 which tells us that the latter must have a symmetric tridiagonal matrix rep- 
resentation. In theory there are a number of ways to calculate zeros of a such 
polynomial. The most effective one is to compute them as eigenvalues of a sym- 
metric tridiagonal matrix (just like the classical orthogonal polynomials), see, e. g., 
Goedecker [21]. This leads to great reductions in cost of quadratures. To use such 
a method for finding zeros of q nr , it is necessary to evaluate the elements of the 
Jacobi matrix 3 n (I ), for which q 7ltT is its characteristic polynomial. If we knew 
the quasi-orthogonal polynomial in the form (3.32), we could in principle find its 
Jacobi matrix for small r, see [17, 46], but, because of nonlinearity, the complexity 
increases rapidly with r. We will sec in the section that follows how to use this 
technique in the case r — 2. 

Finally, in the light of the discussion following the statement of Corollary 1, 
and in view of Theorem 4, the weights in (1.1) can be computed very easily and 
efficiently once the interior nodes have been determined. 

At this point in order to apply the results of this section, the matrix 3 n (I ) 
must be investigated. This serves as a motivation for the work outlined in the next 
section, in which we will say even more on the Jacobi matrix 3 n (I ). 

4. Computation of the Jacobi matrix 

The eigenvalue analysis described in Section 3 suggests to compute the roots of 
the n— th orthogonal polynomial with respect to / .As shown in Corollary 1, it 
can be computed via the Jacobi matrix 3 n {I ), which we denote by 
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(4.1) 



J«(J ) 



«() 


y/h 










Jbl 


a i 















V&n-2 










\/K-2 




«n-2 
\An-l 


y/bn-i 



This matrix plays a very important role in what follows. In order to clarify the 
connection between the original Jacobi matrix 3 n (da) and 3 n {I ), we first collect 
some auxiliary results for orthogonal polynomials {vr;} . Thus, the remainder of this 
section is mainly devoted to the details of computing the coefficients of 3 n (I ) 
whose eigenvalues are the quadrature nodes. In so doing, we find a remarkable 
structure of 3 n (I q7i )• 

The orthogonal polynomials {7?;} with respect to dd (3.1) are important tools 
for us. We suppose them monic, i. e., they arc normalized so that the leading 
coefficient of each 7?/ is 1. Since dd is an even measure, then these orthogonal 
polynomials satisfy a three-term recurrence relation of the form 

7r_i(a;) = 0, 9 (x) = 1, 

7T( + l(x) = XTTl(x) -f3lTTl-l(x), 1 = 0,1,2,..., 



(4.2) 



where the coefficients 
jes: 



< j3i , I = 1 , . . . > are given by the well-known formulae of Stielt- 



(4.3) 0o = o (da) = las (ttq) , A = &{&) 



h* (7rf_i) ' 



1 = 1, 



with Ids being the functional 



ha (u) = / u(x)da(x). 



Note that 0i > . These coefficients (assumed known or computable) define an 
unreduced symmetric tridiagonal matrix 3 n (da), the so-called Jacobi matrix, 



(4.4) 



3 n (da) 



0n-2 





0n-2 








0n- 



0n-l 




With the vector 



V n (x) = (7r (a;),7Ti(x),...,7r„(a;)) T 
we can conveniently rewrite the three-term recurrence relation (4.2) in matrix no- 
tation 

xv n _i(x) = 3 n (da)v n _ 1 (x) + n TT n (x)e n 
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where e„ = (0, 0, ..., 1) denotes the n-th unit vector. 

In order to construct (1.1), we need to know the coefficients of its Jacobi matrix 
J ra ( I ) . To do this, we begin by proving an essential result which reduces consid- 
erably the number of unknown coefficients in J n (I ). In the following the symbol 
[x] denotes as usual the greatest integer in the real number x. 

Lemma 1. Suppose da, J„(7 ) and 3 n (da) are given respectively as in (3.1), 
(4-1) and (4-4)- Then, for any integer r, such that < r < n, the following hold: 

(a) the first n — [r/2] — 1 diagonal elements of 3 n {I ) are equal to 0, 
and 

(b) the first n — [(r + l)/2] subdiagonal elements of 3 n {I ) equal those of 
Jn(dd). 

Proof. We first note that the recursion coefficients a^ , bk in turn are expressible in 
terms of the monic orthogonal polynomials qk relative to the functional I qn r as 

jfe, (sgfc) , n , 

flfe = ~r j-2s-,k = 0,...,n- 1, 

(4 - 5) \ (it) 

b o = Iq n , r (go) , b k = qn ' T , 2 fc v fc = l,...,n- 1. 

On the other hand, if / is an arbitrary polynomial in T-^n-r-i then it can be 
written uniquely as / = qq n . r + h, where q G V n - r -\ and h G V n - r -i- This implies 
whenever / is a polynomial of degree less than or equal to In — r + 1 , 

A 

Iq n , r (f) = QdS.m.nQ 1 ; jl, •••, jm), 

A 

where the trivial relations f(x^ m . n ) — h(%i,m,n), I = 1, •■•, n, have been used in the 

A 

last equality. Thus, since Q d $ mn is exact for all polynomials of degree < In— r— 1, 
we conclude that 

(4-6) I qntr (f) = Ids(f), V/ e V^n-r-i- 

Moreover, for every k = 0, ...,n— \{r + l)/2] , and / € Pfe-i, we have g^/ G ^n-r-i 
and therefore, by (4.6), 

IqnAlkf) = I ds(qkf), 
0. 

This means that ^ is also orthogonal to Vk-i with respect to the linear functional 
/rfj. Since the two polynomials qk and 7?/c have the same leading coefficient, it 
follows from uniqueness that they must be identical. This identifies qk as 7Tfc, for 
k = 0, ..., n— \(r + l)/2] . The proof of (a) and (b) is completed by using the formulae 
(4.3) and (4.5). □ 

This Lemma shows that the first desired In — r — 1 recursion coefficients in the 
sequence {ao, b, m,bi r ..} equal the corresponding coefficients belonging the measure 

da in the known sequence < 0, /3o,0, j3i, ... \ ■ So, an easy calculation shows that 

J„(J ) contains only r parameters. The basic idea of computing them is as 
follows. Suppose that the nodes polynomial q nr has been determined as a quasi- 
orthogonal polynomial degree n and order r, one can therefore obtain the remaining 
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r unknown coefficients of 3 n (I ) by the condition that q n ^ r is its characteristic 
polynomial. However, in order to determine this polynomial, we need to solve a 
nonlinear system of r equations with the r unknown coefficients pk- Of course, this is 
in general not so easy for large r to evaluate such coefficients by explicit expressions. 
Nevertheless, as we shall sec in the next section, we are able to determine them 
analytically at least for small values of r that are of interest in applications. 
We are thereby interested in the following problem: 

Problem 1. Given a quasi-orthogonal polynomial in the form q n ^ r — 7r„ + pi7r n -i + 
. . . + p r Tr n - r . As such how it can be represented as characteristic polynomial of a 
symmetric tridiagonal matrix. 

Explicit results in this direction, involving an arbitrary measure, indeed have 
already been obtained in [17] for the case r = 1, 2, 3 or 4. We reformulate it, in our 
situation, as follows: 

Theorem 7. Let q n ^ be a quasi- orthogonal polynomial of the form 

Qn,4 = 7?n + Pl^n-1 + • • • + P4^n-i, 

where Then q n ^ has a symmetric tridiagonal matrix representation of the form 

q n ,i{ x ) = det(xl„ - J*) 
with 



j: = 












if and only if 

(4.7) 

and 








0n-2 - V 2 









Pn-2 - V 2 

-u 2 



ftn 



V\ 








Pn-1 ~ Vi 
-Ui 



Pi 



< 
< 



/3n-3f3n-2, 
Pn-1, 



Ml 
W, 2 

V2 



92- 

P\- 
PZ 

Pn-2 
Pi 

Pn-3 



v 2 -u 2 ui, 
u 2 

-V2P1 

v 2 



The main results of the previous Theorem were given for the particular case 
r < 4. Of course, it could have been carried over to r > 5, but the problem is 
mainly computational. A more general form of this result has already appeared in 
[46, Theorem 4.1]. 

Note that, the quasi-orthogonal polynomial considered here has a symmetric 
tridiagonal matrix representation, then the inequalities (4.7) are automatically sat- 
isfied. 
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As remarked previously for quadrature formula, it is important to have all its 
nodes in the support of the measure. We shall see in the next section when trying 
to construct the quasi-orthogonal polynomials q n _ r that there are, in general, only 
two polynomials which satisfy the conditions (3.21). 

Then, we arc faced with the following problem: 

Problem 2. How should one select the polynomial which has all its zeros in (— 1, 1)? 

In general, it is easy for small r to diagnose whether a quasi-orthogonal polyno- 
mial has all its zeros located in (— 1, 1) once its coefficients {f>k} are available: we 
refer the interested reader to the discussion of that point in [14] or [45] . For the 
case r = 2, the following Theorem gives an easy way which ensures that a quasi- 
orthogonal polynomials has no exterior roots. This result will play particularly a 
key role in our discussion later in obtaining an explicit expression for the nodes 
polynomial of (1.1) for the particular case r — 2. 

Theorem 8. Let q n2 be a quasi-orthogonal polynomial of degree n, and order 2, 
of the form 

A 

Suppose that p 2 < /3„_i, then q ny2 has all its zeros in (— 1, 1) if and only if 

7T„(-1) 7T n (l) 



Pi < min 



7T„_2(-1)' 7? n -2(l) 



Proof. By Theorem 7, the n zeros X\ < x 2 < ... < x n of q n ^ 2 are distinct eigenvalues 
of the symmetric tridiagonal matrix 



J«(<?n,2) 































y/lL-2 









\Jpn-2 







- Pi 





V Pn-1 - Pi 





Let the n — 1 zeros of TT n -i be ordered according to ti < ti < ... < t n -\. 
Obviously, these nodes are eigenvalues of the (n— 1) x (n— 1) leading submatrix of 
the matrix 3 n (<ln,i), then Cauchy's interlace theorem (see, e. g., [30, p. 186]) can 
be applied to give the following interlacing property of the nodes: 

Xi < h < x 2 < ... < t n _i < x n , 

which implies that q n ^ 2 has n — 2 zeros on (— 1, 1) . Note also that, q n>2 can have at 
most two exterior nodes, one at each end point, because of the interlacing property. 
Since the leading coefficient of q n _ 2 is 1, then q n .i(X) is positive if only if it has 
no zeros greater that or equal to 1. According that 7r„(l) and 7r„_ 2 (l) are both 

positive, and hence x n G ( — 1,1) if only if p 2 < — " — ; the derivation of X\ is 

7Tn-2(l) 

similar. D 
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5. Computation of Turan quadratures 

In this section, we see how the above theory is put into practice to construct a 
new class of quadrature formulae of type (1.1). As with the earlier results quoted 
above, the amount of computation in our approach increases rapidly with r. To 
shorten our presentation and to better illustrate our approach we only concentrate 
on the devivation of our quadrature in the case r = 2. More precisely, we shall 
examine the problem of computing the following quadrature formula, 

/l n 

f(x)da(x) « «2, n /"(0) + ^ \nf(Xl, n ), 
1 1=1 

which is exact for all / <G T^n+i- 

Note that, as we have previously pointed out at the introduction, the present 
quadrature offers another solution for the problem of Turan ( [43, Problem XXXIII]) . 

Since the construction of (5.1) is rather technical, we sketch the main ideas. To 
begin, observe that Theorem 1 guarantees the existence and uniqueness of (5.1), 
with respect to da on the interval [—1,1]. Moreover, in this particular case (using 
the values consistent with our notations of Section 3) the relevant parameters are 
m = 1, ji = 1 and r = 2. 

Following the analysis of the previous sections, the first step in our method is 
to establish some important information about the entries of 3 n {I ) and the 
coefficients of the quasi-orthogonal polynomial q n ^ associated to (5.1). The lemma 
1 in Section 4 furthers this aim, since, in the present case, it reduces the problem of 
finding the Jacobi matrix 3 n (I ) (4.1) to compute the remaining two its unknown 
coefficients a„_i and 6„_i. Indeed, here the known elements of Jacobi matrix are 

a, = 0, i = 0, ..., n — 3 
h = Pi, i = 1, ...,n- 2, 

where /3j, being the recurrence coefficients associated with the orthogonal polyno- 
mials 

{tt™ = n n (.;da(x) (= xi dcr(x))} n=012 ^ 

This observation drastically simplifies the calculation issues associated with (5.1) 
which we will carry out in what follows. 

We are now faced with the problem of computing the remaining unknown coef- 
ficients a„_i, a„_ 2 & n( l fr«-i m the Jacobi matrix 3 n (I ). Thanks to Theorem 7, 
this task can be achieved by computing the quasi-orthogonal polynomial induced 
by (5.1). Note that, in view of Theorem 5, the latter is a quasi-orthogonal polyno- 
mial of degree n and order 2. Thus, we see that (5.1) exists if its nodes are roots of 
a polynomial, which can be expressed in term of {7fn-fc}fc=o as 

(5.2) q n _2 = 7T„ + C„7T„_l + d„Tf„-2- 

So we have two unknown coefficients in q n ,2- To find them, recall that q n _2 is an 
odd or even function depending on whether n is even or odd, then it is clear that 
there is only one free parameter in q n ,2, since, due to symmetry, one obtains c n = 0. 
Thus the polynomial to be studied now takes the form 

(5-3) q n ,2 = 7T n + d„Tf„-2- 
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Furthermore, let us note that, applying Theorem 7 allows us to infer that the 
unknown coefficient a„_ 2 & n d a n -i a re equal to 0. Thus, the problem of finding 
(5.1) is essentially solved if cither the parameters d n and 6 n _i in J n (I ) and q Ut 2 
respectively can be evaluated analytically or at most numerically. We remark that 
once d n is know 6„_i can be determined uniquely by the representation Theorem 
7. 

Now, we proceed by first discussing how one can determine the remaining free 
parameters d n in (5.3), and then outline the basic steps of a numerical algorithm 
for constructing (5.1). The parity of n plays a basic role in the computation of the 
latter, for this reason, we have to distinguish between two cases. First we begin 
with: 

5.1. Case I: n is odd. This is the simplest case. The key observation is that in 
view of Theorem 6 the quasi-orthogonal polynomial q n ,2{x)/x coincides with the 
monic orthogonal polynomial 7r„_i. Thus, the nodes of (5.1) are precisely zeros of 
the polynomial 

(5.4) q n ,2(x) = xn n -i(x). 

Since X7r n _i can be written (uniquely) as a linear combination of ttq^i, ...,7r n , the 
orthogonality relations of {717} yield 

<lnfi{x) = 7„7T„ + 7„_l7T„_l + 7„-27Tri-2- 

Clearly 7„ = 1, since q n ^2, K n are monic polynomials having the same degree. More- 
over, we also have 

7n-i = LiXn^^daix)/ f_ 1 9l_ 1 (x)da(x), 

= 0, 

since the integrand xtt^-i an d is an odd function, and so, by virtue of 

7„_ 2 = / xn n ^ 1 (x)Tf n ^ 2 (x)da(x)/ TT^_ 2 (x)da(x), 
the expression of q n ^ becomes 



(5.5) q° l2 := q n , 2 = n n + I / X7r n ^i(x)7r n ^ 2 (x)da(x)/ 9 n _ 2 (x)dd(x) I 7r„_ 2 - 

Thus, the remaining free parameter d n in (5.3) is equal to 

(5.6) d° n := d„ = 7„_ 2 - 

Having determined the polynomial q n .2, we now debate the principal topic of this 
case, that of determining the nodes which occur in (5.1). So we need investigate 
the zeros of q n ,2- For this, we shall now discuss the determination of the remaining 
parameter in the Jacobi matrix J°(/ ) := J n (I ). Our solution is based on 
the fact that the polynomial q n> 2 admits the following matrix representation (see 
Theorem 7) 

«2,n(a;) = det(a;I„ - J° (J J), 
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where 



(5-7) J°(/, Bi2 







Pn-2 




Pn-2 




&.-1 - d° n 








- d° 



Thus, it follows from (5.7) that the unknown coefficient fe„_i in the Jacobi matrix 
J n(I qn2 ) is given by 



(5.8) 



bn-l — Pn-1 



dZ. 



Therefore, in this particular case, the nodes xi^ n of (5.1) can be computed con- 
veniently as the well-studied ordinary Gaussian quadrature formulae. 

It remains to calculate the weights A/ jM in (5.1). To do this, we suppose that the 
eigenvectors of J° (/ ) arc computed such that 



J°(/, na )V,, n = a;,, n V, >n) 



/ = l,...,n, 



with V^ n V; „ = 1 and Vf n — (vi,/, n , •••, v n j, n ). Then, as for the ordinary Gauss 
quadrature formula, it follows from Theorem 4 that the coefficients A; ]n are ex- 
pressible in terms of the first components v\ t i_ n of V/ jn by 



A, 



•%/, 



da(x), I = 1, ..., n and Z is such that xi_ n =/= 0. 



We finally have that the weights 0J2, n an d W2, n (0) (the weight relative to the 
node 0) in (5.1), in turn is expressible in term of the orthogonal polynomial 7r„_i 
as 

J_ 1 x 2 9„-i(x)da(x) 



U2,: 



Wn(0) = 2 



27r„_i(0) 
f_ 1 7r n _i(x)do-(or) 7r^_i(0) 



W2,n- 



J 1 _ 1 x 2 n n - 1 {x)da{x) 27r n _i(0)^ 

It should be noted that in the present case the nonzero nodes in (5.1) coincide 
with the nonzero nodes in the quadrature formula, 

/l 3 n— 1 

f(x)da(x) = y^a;j,»/ ' (0) + ^ h,nf{xi,n), V/ € 7> 2 n+i> 

1 i=0 Z=l 

discussed by Stancu and Stroud [41, p. 387], since the coefficients of the odd 
derivatives /^(O) and / (0) are equal to zero. 

5.2. Case II: n is even. In the present case, which is treated differently, we 
proceed in the following manner. Our first goal is to construct a system of nonlinear 
equations implementing conditions (3.21) of Theorem 5 for the unknown coefficient 
d n of the nodes polynomial q n 2 (5.3). Thus, at first, we obtain from (3.14) that 

l^n+3 = spa«{*o,*i}, 
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where Wo and ^1 are two polynomials of exact degree n + 3, which satisfy the 
interpolation conditions 



o{xi,n) = 


= *g(0) = 


= *S'(0) = 


= o, 


1 = 1,. 


.., n 


l{xi,n) = 


= *i'(0) = 


= *i"(0) = 


= o, 


1 = 1,. 


..,n 



Here, we are forced to look for an appropriate base for the space /C n +3 in a way 
which will make these conditions simple to determine the parameter d n in q n .2- To 
do so, we first note that from the definition of JC n +3 its elements also vanish at the 
nodes of (5.1). Moreover, since n is assumed even it follows that q n ^ is an even 
function, it can then be easily verified that the polynomials 

(5-9) Mx) = (x 3 + x 2 - 2q ^l) qn , 2 (x), 

and 

(5-10) * 1 (x) = (x*-x 2 + 2 %^j§) qn , 2 (x), 

form a basis for /C„ + 3. We do not present the coefficient of the elements of {\Po, ^1} 
in term of the standard power expansion, but in place we express them in term of 
the nodes polynomial q n ,2i such a representation is the most convenient form for 
computation, since it unveils important relationships between the expansion coeffi- 
cients and the nodes polynomial q n ,2- This is the basic idea behind the construction 
of (5.1) in the present case. 

On the other hand, according to Theorem 5, the polynomials "Jo and Wi defined 
in (5.9) and (5.10) must also satisfy the orthogonality relations 
i ,i 

i J-i 

Thus, by symmetry and taking into account that q n _2 is an even function, the 
remaining unknown parameter d n in (5.3) is then a solution of 

, 5 n) j 1 _ l x 2 g na {x)da{x) = 2g», 2 (0) 

J\q n ,2(x)da(x) q'LM ' 

Since q n ^. — Tt n + d n n n -2, using an elementary manipulation one may show that the 
last equation can be re-expressed in the interesting and simpler form of a quadratic 
polynomial, 

(5.12) ap 2 + bp + c = 0. 

Of course, the remaining unknown coefficient d n in q n _2 will be certainly found 
amongst roots of the (5.12). Then, the next important issue that we face here is 
how to assess the exact value of d n . Now, by Theorem 8 the only solution d e n := d n 
of (5.12) such that 

d e n < min • 

guarantees that the polynomial, 

(5.13) q e n 2 '■= <ln,2 = n n + d e n TT ri -2, 

has n distinct real zeros all located in the open interval (— 1, 1) 



7Tn(-l) 7Tn(l) 

T n _ 2 (-1)' 7T„_2(1) 
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We now turn to the problem of determining the Jacobi matrix J^(I ) := 
3 n (I ). As before, the following construction provides a tool for finding its re- 
maining unknown coefficient fe n _i, once the parameter d n in (5.3) is obtained. We 
may rewrite, using Theorem 7, the polynomial q n 2 in matrix notation as follows 



9n,2(a;) = det(a:I n - J*(J )), 



where 
(5.14) 



Jn(I) 







Pn-2 





Pn-2 








Pn-1 



Pn-1 - d% 





a fact which allows us to compute the nodes x^ n and their respective weights w;.„ 
in (5.1) by standard techniques via an cigensystem problem for the (symmetric, 
tridiagonal) Jacobi matrix 3 e n (I ). 

Finally, as above, a simple calculation shows that an explicit formula for the 
weight L02. n in (5-1) is 



<*>2,'. 



I-l x2 €,2( X ) d,J ( X ) 

2C 2 (0) 



An alternative approach to constructing (5.1) when n is odd, without the use 
of Theorem 6, would be to proceed analogously as in the present case. Indeed, 
to adapt the latter to the odd case, we just have to find a more workable basis of 
/C ra +3. We can do even better by observing that, when n is odd the polynomials 



V (x) = (x 3 + l)q„, 2 (x) 



and 



*i(x) = (x 3 - l)q n , 2 {x), 

form a basis for IC n +3- Thus, in view of (3.21) and the parity of q n ,2, the resulting 
quasi-orthogonal polynomial q n ^ turns out to be identical to the one given by (5.5), 
so the argument as in the odd case can be carried over verbatim to construct (5.1). 

One particularly interesting point to note from either J°(i ) or J^j(I ) is 
that their structures are unexpectedly simple. Since, in each case, to evaluate 
them, it suffices to take the same set of recurrence coefficients as when computing 
the Gauss quadrature formula G(n;da), except that the last coefficient (3 n -i is 
modified. The rest of computation proceeds exactly as usual. This permits a more 
straightforward implementation and simplifies coding. 

We summarize this construction process in the following theorem. 

Theorem 9. Given a nonnegative even measure da and nonnegative integers n. 
Define do — x i da. Suppose the quasi-orthogonal polynomials q° 2 > 9n 2 an d the 
matrices J° n (I ), 3 e n (J ) are given respectively as in (5.5), (5.13) and (5.7), 
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(5.14)- Then, there exists a unique quadrature formula of the form, 

/l n 

f{x)da{x) « W2,n/"(0) + Y, \nf{x hn ), 
_1 1=1 

which integrates exactly all polynomial from Vzn+i- With: 

If n is odd, the nodes Xi, n ,l — l,...,n, in (5.15) are the eigenvalues o/J°(/ ) 
and the weights W2, n and u) n (0) are given by 

f_ 1 x 2 TT n _ 1 (x)da(x) 
^ n = 2^77(0) ' 

(5 ' 16) (n . J J-i'n-iMArM ^-!(0) \ 

" n(0) =: Hz-i^-iW^w"^ 1 ^)/" 2 '"' 

7/n is e?;en ; t/ie nodes x^ n ,l = l,...,n, in (5.15) are the eigenvalues of J„(I ) 
and the weight u>2, n * s given by 

(5 ' 17) W2 ^ = Hjo) • 

In all cases, the weights Aj jn are all positive and are given by 

v 2 f 1 
A; „ = — j^" 1 / da(x), I = 1, ..., n and I is such that x;.„ ^ 0, 



.r; 



wi£/i vu t n being respectively the first components of the normalized eigenvectors of 
J°(J q ), J%(I q ) relative to the eigenvalues xi >n . 

Proof. It remains to determine the sign of o>2, n in (5.15). This can be done by 
deriving an explicit formula for 0J2,m from which its sign can be read off. To do so, 
we first define the polynomial 

, v gn,2(s) 

P2n(X) 2« 2 (0))2 + 2g rl , 2 (0)< 2 (0)- 

Therefore since p 2n € T > 2n+i an d (5.15) is exact on T^n+ii we &l so have the following 
representation 

Ld 2 ,n = / P2n(x)da(x). 

Hence, the desired result now follows immediately, by observing that 

2(<4 2 (0)) 2 + 2^(0)^(0) = 2(<4 2 (0)) 2 > 
if n is odd, and 

n 

2« 2 (0)) 2 + 2< Zn , 2 (0)< 2 (0) = -g 2 . 2 (0) £ — < 

1=1 X hn 

if n is even. D 

The following is the basis steps of the numerical algorithm for computing the 
Turan quadrature formula (5.1). 
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(1 

(2 
(3 
(4 
(5 
(6 
(7 
(8 
(9 



Generate the matrix 3 n (da). 
If (n is odd) then 

Compute d° n by (5.6) and use it to generate the matrix 3° n (I ). 
Compute the eigenvalues and first clement of the eigenvectors of J° (da). 
Compute the coefficients t02, n and co n (0) via a direct implementation of (5.16). 
else 

Compute d e n by (5.12) and use it to generate the matrix 3 e n (I ). 
Compute the eigenvalues and first clement the eigenvectors of 3 e n (I ). 
Compute the coefficient LU2,n via a direct implementation of (5.17). 



Of course, in steps (5) and (7), we use the standard software for classical Gauss- 
ian quadrature formulae. For a general view, see the recent software package of 
Gautschi [19]. 

For a numerical point of view, two types of difficulties still remain about the 
coefficients necessary for efficient implementation: 

a) The step (1) requires the coefficients j3k,k — l,...,n — 1. How to evaluate 
them? 

b) The principal difficulty with the steps (3), (5), (7) and (9) lies in the integrals 
needed to evaluate the exact root d n of (5.12) and the weights w 2 ,n7 uj n (0). How to 
efficiently compute them? 

We give a satisfactory answer to these questions as we will explain below. In 
the classical cases, the coefficients j3k (of the original Jacobi matrix 3 n (da)) are 
known explicitly [42] or are easily derivable from standard results; in others, the 
software of Gautschi [19] computes them as a preliminary step. For the integrals 
which are contained in the coefficients d° n , d e n , u>2,n and w„(0) they can, in general, 
not be computed explicitly, to circumvent this difficulty, one judiciously has to 
use an appropriate quadrature formula to approximate them. We remark that, 
since the integrands are polynomials, we can evaluate these parameters efficiently 
to essentially any desired degree of accuracy. However, we shall indicate for the 
Legendre case how such integrals may be determined explicitly. This issue will 
be investigated by means of some properties of Jacobi polynomials in the next 
subsection. 



5.2.1. Legendre case (da(x) — dx). In the present case the task is easy, since we 
shall show that the required coefficients of 3 n (I ) may be obtained explicitly 
from 3 n (da). Theorem 10 below describes the calculation of the matrix 3 n (I ). 
For convenience, we first present five technical Lemmas which give us some basic 
results that will be needed to obtain explicit expressions for (5.1). The proofs of 
these Lemmas are deferred to the Appendix A in the end of this paper. 

To commence our nalysis, we note, first for all, since da(x) — dx, that da(x) = 
x dx. Thus, in the present case, the orthogonal polynomials {7?n} can be explic- 
itly written down. In fact, they can be reduced to Jacobi orthogonal polynomials 

\pn \ with parameters a, (3, (see Szego [42, formula 4.1.6]): 
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2I/ + 3/2 V L(0,3/2) (2a;2 _ 1); ifn = 2i/j 



(5.18) tt„(x) = < 



For convenience, we let 



2j/ + 5/2 \ I fo,5/2„ 0i „ 



;/ 



xpiT' 0/ ZJ (2a; 2 -l), ifn = 2z/ + l. 



(5.19) ^{(^/^"jand^)^ 2 *-^ 

This representation is the key tool for the analysis we undertake. As an useful 
consequence, this additional information allows us, for example, to derive explicit 
formulae for the elements of the matrix 3 n (da). We have, in fact, 

Lemma 2. The elements /3fe,fc = 1, ...,n — 1, on the side diagonals of the Jacobi 
matrix J n (da) are given by 



(5.20) /3 fc = s ,o„^2 



4v 2 /((4v + 3)(4v + 5)), ifk = 2is, 

(2v + 5) 2 /((4i/ + 5)(4v + 7)), i/n = 2v + 1. 



The next Lemma shows that, when n is odd the nodes polynomial q° 2 (5-5) is 
simply related to the (monic) Jacobi polynomials. 

Lemma 3. Let n be odd (n = 2v + 1) . Suppose c 2 and fii v are given respectively 
as in (5.19) and (5.20). Then the nodes polynomial <7„, 2 of (5.1) has the following 
representation 

(5.21) g° i2 (aO = c 2 ( V )xp^ 2 \2x 2 - 1) + ^c 2 (^ - l)xp^\ /2) (2x 2 - 1). 

While expressions for the weights toi.n and w n (0) in (5.1) have already been 
given in (5.16) and (5.17), for general measure, alternative, more explicit, formulae 
can be obtained for Legendre measure. Those for the odd case, given in the next 
Lemma, arc particularly simple. 

Lemma 4. Let n be odd (n = 2v + 1) . Then for the Legendre measure the quad- 
rature weights LU 2 , n an d the one w„(0) relative the term /(0) in (5.1) are given 
by 

3tt f I > + 1) \ 2 



(5.22) W2 '" 16ll> + 5/2)i 

6(5+10^ + 4l 2 ) 

W„(0) = W 2 ,n- 

5 

The positivity of the weight oj 2 .„ is evident from (5.22). For the case n even, to 
simplify the computation of the elements of 3 n (I ), the next result identifies the 
nodes polynomial q e n 2 . 

Lemma 5. Let n be even (n = 2v) . Define 

v{2> + 2i/)(5 - 3\/5 + (12 - 4\/5> + 8^ 2 ) 



d n = 



(-1 + 2^ + 4^ 2 )(3 + 16zv + 16^ 2 ) 
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Suppose c\ are given as in (5.19). Then the nodes polynomial q n ,2 of (5.1) has the 
following representation 

(5.23) <£ i2 (aO = ci(i/)p<°' 3 / 2 >(2* 2 - 1) + d n c x {y - l)p<°_l 3 / 2) (2s 2 - 1). 

For the even case, the weight LU2,n corresponding the term /"(0) in (5.1) could 
also be expressed in term of n alone, as shown in he following Lemma. 

Lemma 6. Let n be even (n = 2v) . Then for the Legendre measure the quadrature 
weight LU2.n in (5-1) is given by 

V5tt 3(3 + 2^) f T(z/+1) I 2 
W2 '" ~ "(1 + a/5) 2 8^ \i> + 5/2)J 

We are now in condition to give the principal numerical tool for obtaining the 
quadrature formula (5.1). This result is an immediate consequence of Lemmas 2, 
3, 4, 5, 6 and Theorem 9. 

Theorem 10. Let J n be the (n)—th order tridiagonal matrix defined by on the 
main diagonal and \fWk on the side diagonals, with 

* _ f 4^/((4*/ + 3)(4z/ + 5)), ifk = 2v, 

Pk -\ (2i/ + 3) 2 /((4i/ + 1)(4i/ + 3)), ifk = 2u-l. ' K - i '-' n z ' 

and 

{0, ifn=2m+ 1, 

(4m+l)(4m + 3) "' ; ' 

where 

_ to(3 + 2m) (5 - 3\/5 + (12 - 4\/5)ra + 8m 2 ) 
" ~ (-1 + 2m + 4to 2 )(3 + 16m + 16m 2 ) ' 

Then, there exists a unique quadrature formula of the form, 

(5-24) f f(x)dx = ^ 2 L „/"(0) + J2 *Lf(xL)y.f G *Wi, 

■'- 1 z=i 

where xf n , i — 1, ..., n, are i/ie n distinct real eigenvalues of J n and the respective 
weights are given by 

9 v 2 

•\ = x 2 "^ ' = 1) ••■j n anc ^ ^ * s su c/i i/iai x; „ ^ 0, 

5 <„ 

wzi/i Vi j n demj i/ie first components of the normalized eigenvectors of 3 n corre- 
sponding to the eigenvalues x^ n . The weights w^n an< ^ ^(C) (the weight relative 
the term f(0)) are given by 

3n f rjm+1) I 2 .. _ J _ 1 

: — > . if n = 2m, + 1 , 



L i 16 [L(m + 5/2) J ' 

; ^ n * V5tt 3(3 + 2m) f L(m+1) ) 2 



(1 + ^) 2 8m \L(m + 5/2)/ ^ ~ 2m ' 

L 9^(5 + 10m + 4m 2 ) f L(m + 1) | 2 
A " (0) = 40 \r(m + 5/2)/ ' ^ = 2w + L 
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This quadrature formula enjoys perfect symmetry; indeed we can prove easily 
that 

x n+l — i,n ' x i,n "> "n+1— i,n \,ni l •*■) •*•! n - 

The construction of quadrature formulae of type (5.1), for a more general weight 
function, in particular, for Jacobi weight function presents no extra difficulties. The 
latter can be done in precisely the same way that these results were established for 
the Gauss-Legendre case. We therefore omit this development here. 

5.3. Comparative numerical results. In this section, we compare the quadra- 
ture formula (BL„) 

/l n 

f(x)dx = Loij"{0) + Y, A,V(^n). V/ G TWi, 
_1 i=i 

developed in Theorem 10 and the classical Gauss-Legendre formula (GL„ +1 ) 

/l n+l 

f{x)dx = J2 k,nf{xt,n),Vf G V2n+1- 
~ l 1=1 

Note that the quadrature formulae (5.25) and (5.26) use the same number of eval- 
uations of integrand and have the same MPDE. For illustration, we experiment 
with numerical examples which involve three different kinds of integration prob- 
lems. They differ mostly in the specific properties of the integrand functions. The 
first one is a parametrized family of functions which have a peak at a point near 
the point with a severity that is controlled by a parameter. In the second ex- 
ample, in another context, we will choose a whole family of integrands which have 
a logarithmic endpoint singularity, and then we conclude with a final example in 
which the oscillations increase. The three examples are as follows: 

h(v) = J\ 3v2+( l v+xr dx, (v>0) 
arctan(^) arctan(^f) 



V3v V3 



r 



h(v) = U ^+ x) 7 dx, w e(o,i)>o, 

16 log(2) 16 log(2) 

3-8w- 16 v 2 ~ 5-24v + 16i; 2 
16 1og(5-4w) 16 1og(3 + 4u) 



2 IT 



Izijn) — I x cos(50x) sm{mx)dx = omn -, (to ^ 50), 



5-24w+16w 2 5-24u + 16i; 2 
2to7t 
2500 - m 2 

All the computations described in this paper were carried out on a personal IBM 
computer in a double precision. 

Example 1. The integrands here are the functions f v (x) = tttz Z ^ ) ^ > 0, 

1 6a; (— 2 v + x) 
which satisfy fy(0) = -r-^; and fy(x) = , . It should be noted 

Sv ' ' (3v 2 + (-v + x) 2 ^j 

1 
Av 2 

over (— 1, 0) , and concave downward over (0, 2v)). It can also be seen that, when v is 
sufficiently small, this function exhibits a peak near x = 0, with increasing steepness 



that the graph of f v has the inflection point 0, — ~ J , (this graph is concave upward 
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m 


n 


err. GL n+1 


err. BL n 




130 


8.9(-9) 


-1.0(-8) 




180 


2. 2 (-12) 


1.2(-12) 


1 


240 


-6.1 (-14) 


6.3(-14) 




300 


-l.O(-U) 


6.3(-14) 




130 


2.0(-3) 


-1.7(-3) 




180 


-2.1 (-5) 


2.1 (-5) 


2 


240 


1.3 (-7) 


-l.l(-7) 




300 


-l.l(-lO) 


5. 5 (-10) 




130 


-1.7 


1.0 




180 


2.0(-2) 


2. 3 (-2) 


4 


240 


1.2(-2) 


-8. 3 (-3) 




300 


l.l(-4) 


-2.1 (-4) 



Table 1. Numerical results of I\(v) and comparison with Gauss 
quadrature. 



as v tends to 0. One would expect that the difficulty of the integrand f depends 
heavily on the selection of the parameter v, which is equal to the location of the peak. 
For this example, since f"(0) — 0, it seems reasonable, therefore, to compare the 
quadrature formula (BL n ) with the ordinary Gaussian quadrature (GL n ) having the 
same number n of interior nodes. We did our computation for v = v m = 1/10 , 
m = 1,...,6 and n = 130,180,240,300, but in Table 1 shown only select results. 
Also shown in the last two columns are the respective integration errors. It can be 
seen Table 1 that the new quadrature formula produces more accurate result than 
those furnished by the ordinary Gauss quadrature formula, especially for v close 
0. With decreasing v, ordinary Gauss-Legendre is seen to converge rather slowly. 
This weakness is accentuated when the peak is moderately high. However, for small 
to, both quadratures produce results which are comparable in accuracy, the formula 
(GL) being somewhat more accurate than (BL) for small values n, the latter for 
larger values of n. 



Example 2. The integrand here is a function having a logarithmic endpoint sin- 
gularity of the type f v (x) — ;>• Thus, /„(0) = 0, fy(0) = and 

(l+(-\+v)x) 
/"(0) = — Av. We therefore compare the results of (5.25), at the same cost, by the 
ordinary (n + 1) — Gauss quadrature formula (5.26). The results are shown in Ta- 
ble 2 for selected values of v = l/(100j) and n. It can be seen that both formulae 
converge slowly, with (BL) having a slight edge on (GL), particulary when v is very 
close to 0. 



Example 3. As suggested in the introdu cation, we can compare the results of the 
composite quadrature formula, 



CBL n (f) = J2J2( l 

»=i j=i 



BL fiif 

j,Si 



f"(0)+wffj(xf^ Si )), 
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3 


n 


err. GL n+1 


err. BL n 


1 


50 

150 

250 


3.0(-4) 
3.6(-5) 
1.2(-5) 


3.2(-4) 
3.6(-5) 
1.3 (-5) 


2 


50 

150 

250 


3. 3 (-4) 
3.8(-5) 
1.3(-6) 


3.4(-4) 
3.8(-5) 
1.3(-6) 


3 


50 

150 

250 


3.6(-4) 
4.1 (-5) 
1.5 (-5) 


3.1 '(-4) 

3. 2 (-6) 
1.2(-7) 



Table 2. Numerical results of ^(w) and comparison with Gauss 
quadrature. 



200 
250 



err. GL n+ i err. BL n 



1.02(-7) 
2.18(-12) 



9. 9 (-8) 
3. 581 (-13) 



Table 3. Numerical results of I\{v) and comparison with Gauss 
quadrature. 



n 

200 



err. GL n 
1.17(-6) 



err. BL n 
8.93 (-7) 



Table 4. Numerical results of I\{v) and comparison with Gauss 
quadrature. 



with the standard composite quadrature formula 



r+l Si + 1 



CGL n+1 (f) = ^ ^ 



w 



GL 



J\ x j,i,s) 



i=l j = l 



relative to the same subdivision [—1,1] = 1J [£i-ij£i] • We illustrate these for- 

»=i 
mulae by computing again the integrals Ii(v),i — 1,2. We will refer to the number 

r 

used by each quadrature as n = ^2 Sj. For Ii(v), we did our computation for 

i=\ 

v = 1/lOV = 3, si = 50, s 2 = 100,150, s 3 = 50 and therefore n = 200,250. It 
seems reasonable to employ the quadrature formulae CBL n and CGL n+ i choosing 
the nodes Q,i = 1, ...,r, near 0. Table 3 shows the respective integration errors for 
Co = -1, Ci = -0.2, C2 = 0.2 and & = 1. 

Table 3 shows the error of the s-point approximations Ii(y), for v = 1/300, r = 3, 
Si = 90, S2 = 70 , S3 = 40 and therefore n = 200. 

These examples demonstrate the superiority of CBL n over CGL n+ \. However, 
both quadrature formulae are even better than BL n and GL n+ \. For example, if 
72(1/10 4 ) is approximated by BL n or GL n+ \ with n = 300, they produce only 4 
correct decimal digits, which is several orders of magnitude larger that BL n and 
GL n+ \. 
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6. Applications to polynomial inequalities 

In this last section we apply the results of the previous sections to derive and 
calculate, in a simple manner, new explicit sharp weighted L 2 inequalities for coeffi- 
cients of polynomials. They are all obtained as direct applications of the quadrature 
formula 



(6.1) 



1 m n— m 



l=i 



^l,m,nf{%l,m,n), V/ € V<l n +\ 



We point out that the nodes of such quadrature formula arc the zeros of the 



orthogonal polynomials associated with respect to the measure da 



J2m 



+2 da(x). 



This result can be obtained as an immediate consequence of Theorem 5. 
Problem 3. Letp(x) = aa+diX+...+a n x n be a polynomial of degree n. We assume 



that \\p\\m,2= J-l : 



aQ+aix+...+a n x n be a polynomial of 

2 v 1/2 



dx n 



p(x) > da(x) is given as a measure of the "size' 



of p. We now ask the question, how large the coefficient \a m \ can be? Below, we 
answer this question. We will calculate explicitly the constants that appear in our 
estimations. These are derived using essentially the quadrature formula (6.1). 

Before stating the particular Legendre case, we prove a more general theorem 
which leads to the best possible estimation for \a m \ in terms of ||p|| m 2 • 



Theorem 11. Let p(x) = ^ a,iX l be a polynomial of degree n. Then for any fixed 

i=Q 

k, < m < n, such that n — m is even, we have 



(6.2) 



< 



L n - 



f_j x 2m L n - m (x)da(x) 



lm,2 ' 



where L n ^ m (x) — Y\i=i ( x ~ x i,m,n), with xi_ m ^ ni l = l,...,n, being the nodes of 

d m 
the quadrature formula (6.1). The equality hods if and only if - — p{x) = const 



dx r> 



L r . 



.W- 



Remark 3. An equivalent expression of (6.2) is 



Ln- 



It, 



.2m, 



Tr, 



m,2 
m (x)d0-(x) 



dx< 



Mo) 



peV n ,\\ P \\ 



More explicit estimates than those in Theorem 11 can be obtained when the 
orthogonal polynomials associated with respect to the measure da = x 2rn+2 da(x) 
can be explicitly written down. The content of the following result is to give an 
exact expression of the above constant, for simplicity, we only consider the Legendre 
case. Thus, if da(x) = dx then the above Theorem becomes the following result, 
which will be proven in the Appendix B. 
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n 

Corollary 2. Let p(x) = ^ a,iX l be a polynomial of degree n. Then for any fixed 

j=o 

k, < k < n, such that n — m = 2v , we have 



1 TO + m + 3/2) 

(m!) 2 y/m+1/2 T(m + 1/2) 



( 6 - 3 ) \ a m\ < jz^9 / ; ,„ ^ — r^r^r IML,2> 



TTie equality hods if and only if p(x) — const pi, ' m (2x 2 — 1). 

Let us recall that a number of weighted L 2 inequalities for coefficients of polyno- 
mials have been suggested in [31] and in [25]. A full history and a detailed account 
of the current state of this theory and its applications appears in the recent mono- 
graph [35]. 

7. Concluding remarks 

Our purpose has been to shown how to modify the Jacobi matrix in such a way 
that the nodes and weights of a new class of quadratures can then be evaluated 
directly by standard software, the complexity of the various calculations associated 
with computing such quadratures is significantly reduced. Particularly encouraging 
is that this matrix is easy to construct employing only elementary transformations of 
the recurrence coefficients which characterize the original measure. This approach 
extends the well-known method for ordinary Gaussian quadrature formulae. The 
formulae derived in this way have been implemented and some numerical tests per- 
formed with the latter show a improvement over the classical Gaussian quadrature 
formulae. We have also exploited the connection between quadrature formulae and 
polynomials inequalities to apply the new class of quadratures for the latter to 
derive new explicit sharp inequalities for polynomials. 

For sake of simplicity only even measure has been considered, but the method 
developed here can be applied to arbitrary measure with slight technical modifica- 
tions. It can also be extended to the construction of quadratures with more general 
interior conditions, such as the following class 

k n 

Q(f) =^2 U >J,nCj(f) + ^2 Knf{Xi,n), 
j=l t=l 

where Cj, j = 1, ..., k, are given more general linear functionals of the form 
C«(/) := Yl <W (m) (0), 1 = 1,..,*. 

m— 

The analysis leading to these new quadrature formulae takes place most natu- 
rally on more general domains, for example, on semi-finite or infinite with adapted 
interior conditions. This problem requires different approaches, to which we hope 
to return. 

Finally, the possibly most important merit of this class of quadratures is that 
they can be useful in the numerical solution by spectral approximation of partial 
differential equations, involving interior conditions, as has been successfully applied 
in the author's joint work [17], where efficient methods based on a new family of 
quadrature formulae, with complex boundary conditions terms, were proposed. In 
that paper, the appending of additional terms in the quadratures was motived by 
the well known lumped mass spectral approximation problem. 
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Wc have also apply these quadratures to derive and calculate new explicit sharp 
weighted L 2 inequalities for coefficients of polynomials. 

Appendix A: Proofs of Lemmas. In this Appendix, we provide the proofs 
of Lemmas 2, 3, 4, 5 and 6. The proofs involve substantial use of the properties of 
Jacobi polynomials, these properties are first summarized. They are included here 
to facilitate verification by the interested reader. All the results are standard and 
can be found in books such that [42, 3]. 

We start with some basic notations and known results. For any real number a 
and integer n, the Prochhammer's symbol (a) n is defined by 

(a )„ = a(a + 1 ... a + n - 1) = y '- , 

r(o) 

where T is the Gamma function. With the weight function 

da a ^{x) = {l-x) a {l + xf on (-1,1), (a,0>-l), 

we associate the scalar product 

(7.1) {4>^) a ,, := f HxMx)da a ^(x), 

which is defined on the space of all functions whose square is integrable with respect 
to the measure da a " . There exists a sequence of polynomials called Jacobi poly- 
nomials of parameters a and 0, which arc orthogonal w. r. t. the scalar product 
(., .) a fi . As in [42], we shall use for these polynomials, the ( unusual) notation Pn 
and normalization 

(7-2) ^ ) (l)= (aH ^. 

Their orthogonality relation is [42, Formula 4.3.3] 

(7 3 s ! (r,( a >P) r) ( a -f i )\ — h X 

\'-°) \P n ifra ) a ,f3 — ""n u nm,t 

where 

2 a+l3+1 T(n + a + l)T(n + + 1) 



h n = 



(2n + a + 0+ l)T(n + l)T(n + a + + 1) ' 

and S nm is the Kronecker delta. 

They have the explicit representation (see [42, Formula 4.3.2]) 

^\*)-±(iti)( n y)(^j(^ 

Note that 

(7.4) p^'P'tx) = : -x n + lower order terms. 

2™n! 

This gives us that the associated (monic) orthogonal polynomials are given by 

[n + a + + l) n 
Further we have 

(7.5) p^\-x) = (-l) n £' a \x). 
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As it is well known for Jacobi polynomials the derivatives of arbitrary order are 
also Jacobi polynomials. This is easily derived from general properties of the hy- 
pergeometric functions F(a; b; c; z) [42]. For instance, we have 



P 
Since 



£*'^(1 - 2z) = ( a+ p n F(-n, n + a + + l;a + l;z). 



—F(a;b;c;z) = —F(a+l;b+l;c+l;z), 

dz c 



it follows the simple differentiation formula 

± p ^) {x ) = 1 - {n + a + P+l)pt\ 1 '" +1 \x), 

or more generally for k < n 

(1 *\ dk „(«./») m _ 1 T(n + k + a + (3+l) {a+kJi+k) 

( ' dx* Pn [X) ~ 2* T(n + a + (3+l) Pn ~ k W ' 

Finally, we will need to use heavily the following formulae, sec [42, Formula 4.5.4, 
p. 72], [3, Formula 4, p. 263] and [3, Formula 5, p. 263]. 

(7.7) 

(1 + x) P^ 0+1 \x) = 2n + a 2 +f3 + 2 ((n + P+ l)pi a ' \x) + (n + l)p%?(x)) , 

(7.8) f ^*)^) = 2 -^^^ 

J_i nil (a — p)L \p + p + n + 2) 

and 

( , 9 , i:A^M^^-- ^ T ^::zt:r ] - 

Now we prove the technical results needed in the proof of Theorem 10. We begin 
with the proof of Lemma 2. 



Proof. Let first k be even (k = 2v). Then, by (4.3) and (5.18) we have 
~ _ c 2 H /V{^ 3/2) (2z 2 -l)} 2 rfx 
c2 ^ f^x* {p^/ 2 \2xi -I)}* dx 
cl{v) tix±{p^\2x*-l)} 2 dx 

cl(y) 5lxt{ptT\^ 2 -v} 2 dx 

1c{{v) fh 1 (l+uf /2 {pi at3/2) (u)} 2 du 

c\(y) S^^l+u)^{ P i 1 rku)} 2 du , 

where in the last equality, we have used the change of variable u = 2x 2 — 1. Since, 
from (7.3) we have 



I , .9 2 5 / 2 



(l + U ) 3/2 {pi°' 3/2) («)} du 



(2i/ + 5/2) : 



and 



,( 1+ ») S ' ! K*'W} *= C*W 
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On the other hand, we have 
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n (2^/ + 3/2)^(1/) 
/fe (2r/ + 5/2)c|(i/)' 



d(v) 



c 2 (y) (2i/ + 3/2) 

Inserted in (7.10), this yields the first relation in (5.20). The case k even is proved 

similarly. □ 

We add a remark concerning more general nonncgativc even measure da: In the 
present case, one may verify directly that the polynomial X7r„_i (= q° 2 ) has the 
matrix representation X7r„_i(x) = dct(x/ n — J„), where 















n - 







Pn-l 




A trivial consequence, which, by applying Theorem 7, could also be used to show 
(5.21), is that d n = j3 n -i- 

Remark 4. It is remarkable that, taking into account that we also have q° 2 {x, da) = 
XTT n -i(x, da), V da, the particular and convenient above structure of the matrix J n 
remains valid more generally, for any nonnegative even measure da. 

We now proceed to the proof of Lemma 3. 

Proof. We need only compute d° n — J X7f n -i(x)ir n - 2 (x)da(x) / f_. TT^ l _ 2 {x)da{x) , 
since, from (5.5) and (5.18), we have 

< 2 (z) = c 2 (v)xp^(2x 2 1) + d° n c 2 {v l)xptT\^ 2 I)- 
For this, note first that 



dl = 



c\(v 



(7.11) 



c 2 (y 



c 2 (y 



c 2 {v 



Hy{ P ^' 2 \2x^-l)ptT\^-l)} 



dx 



S-iX 6 {p { °lT\^ 2 -l)} dx 
Ji x± {^°- 3/2) (2x 2 - l)ptT\2x 2 ~l)}dx 



dx 



So* e {ptr\^-D}< 

SUl + u)^\p^' 2 \u)ptT\u)}du 



S\{i + ufn{ P tT\u)\ 



da 



From(7.3) we have 
(7.12) 



2 7/2 



i (i+«r{rf« w w} *= j^j 



163 



GAUSSIAN QUADRATURES OF BIRKHOFF TYPE 

The formula (7.7), with a = 0,(3 = 3/2 and n = v — 1, yields 

(i + u)vtr\u) - ^1^ o + m P tr\u) + ^°< 3/2) (*)) 

Therefore, one finds that 

1 (1 + uf' 2 {p^ 3/2) (u)p^\ /2 \u)} du = e[\l + uf' 2 {p(,°' 3/2) («)} 



2 



du, 



where e = Trio' Using this, and (7.3) with a = 0,/3 = 3/2 and n = v, gives 

1 (1 + U ) 5 / 2 U°' 3/2) («)p^ 5 / 2) (w)i d« = 7 rZ rr2 7/2 - 

.1 ; V" V ^" _1 V ; I (2z/ + 3/2)(2z/ + 5/2) 

This together with (7.12) gives 

/^(H-^^^/ 2 )^)} 2 ^ "(2^ + 5/2)- 

Inserted in (7.11), and also using the fact that — —^ = - r — , this proves the 

ci\y) (2^ + 3/2) 

result. □ 

We pass now the proof of Lemma 4. 
Proof. It follows easily from (5.16) and (5.18) that 

_^2 f\VTTTt{ P { °' 3/2) (u)}du 
W2 '"~^ {p?- 3/2) (-D} ' 

This together with (7.5) and by a change of variable gives 

8 {^ 3/ ' 0) (l)} 

Since, from (7.2), we have 

(7.14) p^°\l) = r ^ + 5 / 2 ) 



r(i/ + i)r(5/2)' 

it remains to evaluate the integral in the numerator of (7.13). To do this, once 
again, we appeal to the identity (7.8), with p — 1/2, a — 3/2 and (3 = 0, to obtain 

(7.15) |' VT^{p^°\u)} du = 2 3/2 r ^ + + 5 ^ r(3/2). 

Finally, combining (7.15), (7.14), (7.13) and using the fact that {r(3/2)r(5/2)} = 
^ yields (5.22). In order to prove the explicit expression for w„(0) we have to use 
an argument similar the one above. This will be omitted here. □ 

Next we prove Lemma 5. 
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Proof. For simplicity, wc refer to d — d n . First, (5.11) gives that d is a zero of 

J\ x 2 ir n {x)dx + dj\ x 2 9 n ^ 2 (x)dx _ 2 7r„(Q) + d7r w _ 2 (Q) 
j\9 n (x)dx + d j\n n _ 2 (x)dx ~ 5r£(0) + d7r£_ 2 (0)" 

As suggested in the remark after the equation (5.13), to establish the lemma, wc 
need to find the zero d of (7.16) with smaller magnitude. To solve equation (7.16) 
we first remark that this is equivalent to 

1 + df\ x 2 n n _ 2 (x)dx/ J\ x 2 TT n (x)dx = 2 l + d7r n _ 2 (0)/7r n (0) 
l + dj\n n _ 2 (x)dx/j\7r n (x)dx ~ / l + d7r; / _ 2 (0)/7f^0)' 

where 

, ^n_(0) /^TTnC^da; 



^"(O)/^ 2 ^)^' 

>ecause it gives a partic 
J^ L 1 a; 2 7r„_ 2 (x)da; (2 V + 1/2) (2v + 3/2) 



We chose this representation because it gives a particularly easy proof of Lemma 
5. We now show that 

(7.18) TVi := 



J_ 1 x 2 7f n (x)dx v 

This follows by observing, since n = 2v, that 

jV^-aPOds = Cl (,-l)/- 1 ^ 2 K- 3 / 2) (2^-l)}^ 

j^ ^TTn^da: Cl(l/) J^ X 2 j^ ' 3 / 2 )^ 2 - 1) } dx 

Cl ( v -i)ti- 2 {ptT\^ 2 -i)}dx 

c i(") J^x 2 {p { J > ' 3/2 \2x 2 -l)}dx 
ci (1/ - 1) /-i VI -" {pL-i' 0) ( m ) } dw 

c iW ^VT^j^V)}^' 

The result now follows by using (7.15). 

To evaluate J_ 1 7r n - 2 (x)dx/ J_ 1 7r n (x)dx we proceed as follows: 



Ji*n- 9 (*)(fe _ Cl {u-l)S\{ptT\^ 2 -l)} 



dx 



which, since 



Il 1 n n (x)dx cM ^ 1 {p(, 0,3/2) (2x=»-l)}(te 

= Cl{ ,-l)Io{ptT\2x 2 -l)}dx 
c iM J Q 1 {p^ 3/2) (2x 2 ~l)}dx 
ci(i/ - 1) /-i {pS-i' 0) («)/>/! - u } du 

c ^) f^j^^/vr^i}^' 
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reduces to 

-= J^n-2W& = (t;+1/2)(2u+1/2)(2u + 3/2) 
1 ' j 2 ' /^TTn^da: " v(v+l)(t; + 3/2) 

Now using (7.2), we get from (7.5) 
, 7<m n _ gn-2(0) _ (2t;+l/2)(2^ + 3/2) 

(7 - 21) Dl - = X(C = (^TW • 

To evaluate 7r"_ 2 (0)/7r"(0), we obtain from 

^) = C i(^°' 3/2) (2* 2 -l) 
by differentiation this expression twice, then set x = and using (7.6) 

^;(o) = (-ir 1 2(, + 5/2) Cl H r( " + 5/2) 

(7.22) D 2 := 



T(7/2)r(i/)' 
so that 

5^-2(0) (i/-l)(2« + l/2)(2t; + 3/2) 



<'(0) i/(v + 3/2)( V + 3/2) 

Combining (7.18), (7.20), (7.21) and (7.22) yields 

1-iVirfl -J>i<* 
1 - 7V 2 d l-D 2 d' 

where 

5(i;+1)(t; + 3/2) 

•' ~~ 2w(u + 5/2) 

Finally, one finds, after a somewhat laborious but elementary calculation, that 

, K3 + 2i/)(5 - 3\/5 + (12 - 4\/5> + 8i/ 2 ) 
~ (-1 + 2v + 4i/ 2 )(3 + 16z/ + 16t/ 2 ) ' 

The representation (5.23) now follows from (5.13). □ 

We now can prove Lemma 6. 

Proof. By (5.17), with the use of (5.23), we obtain 

1 J_ 1 x 2 9 n (x)dx + d n J_ 1 x 2 9 n - 2 (x)dx 
W2 ' n ~ 2 7r„(0)+d„7r„_ 2 (0) ' 

It turn out to be convenient for the computation to write 

1 1 + d n J_ 1 x 2 TT n -2{x)dx/ J_ 1 x 2 7f„(x)dx 
W2 '" = 2- f l + d„7r„_ 2 (0)/5r n (0) ' 

where 

J_ x x 2 TT n {x)dx 

f= MO) ' 

Since 

1 + d n J_ l x 2 TT n ^ 2 {x)dx/J_ 1 x 2 TT n (x)dx l - ]Vid n d 

l + d„7r n _ 2 (0)/7r n (0) ~ 1-A4' 

where JVi and D\ are given by (7.18) and (7.21), we then have 
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We also have 

j\x^ n (x)dx J 1 _ 1 X 2 {p^ 3/2) (2x 2 -l)}dx 

%M U°' 3/2) (-l)} 

/ ^ 2 {pl ' 3/2) (2x 2 -l)}rf.x 

{ P P /2) (-l)} 

4 {^ 3A0) (1)} 

then we use (7.15) and (7.2) to obtain 



/ = 



_ 3 j T(3/2)I>+l) | ; 



2\ I> + 5/2) J 



which, inserted in (7.23) and using the value of d n found in Lemma (5), yields after 
a simple computation the desired result. □ 

Appendix B: Proof of Theorem 11 and Corollary 2. 

The technique we use in the proofs is essentially the same as that in [8, 26]. We 
begin by the proof of Theorem 11. 

Proof. This is an immediate consequence of the quadrature formula (6.1) and the 
Cauchy-Schwarz inequality, combined with a cleaver idea that appears in [8]. By 
the quadrature formula (6.1) which is exact in V^n+i, we have for any p G V n 

x 2m < -r^P{x) \ L n _ m (x)do-(x) = (2m)! I j^p(O) > L n _ m (Q)u) m , n - 
Hence, using the fact that 

x 2m L n ^ m (x)do-(x) = (2m)!L„_ m (0)w TOi „, 
J-i 

we get 

l j\ X 2m | -^P(x) > Tfn-m{x)da{x) 

m - j\x 2m n n - m (x)do-(x) 

The theorem now follows from the Cauchy-Schwarz inequality The sharpness of 
the estimate is guaranteed by the latter. □ 

We now mention how we can prove Corollary 2 

Proof. When do~(x) — dx, using the same arguments as before, the inequality (6.3) 
is a simple combination of Theorem 11, the identities (7.8), (7.9) and the fact that 

9^ m (x)^U 2v + m v +l/2 y 1 \p^^ 2 H2x 2 -l), {n-m = 2v). 

Details are omitted for brevity. □ 



167 



GAUSSIAN QUADRATURES OF BIRKHOFF TYPE 

Acknowledgment. The author would like to acknowledge B. Bojanov and Q. I. 
Rahman for stimulating discussions. The author would like to take this opportunity 
to express his acknowledgcnt to the INTAS for the financial support ( grant INTAS- 
94-4070 of the European Union) . 



[10 

[11 

[12. 
[13 

[11 
[15. 

[16. 

[17. 

[18. 
[19 
[20 

[21 
[22 

[23 

[21 



References 

R. Askcy, Positive quadrature methods and positive polynomial sums, Approximation Theory 
5, C.K. Chui, L. L. Schumaker and J. D. Ward, eds., Academic Press, New York, 1986, 1-30. 
K. Atkinson and A. Sharma, a partial characterization of poised Hcrmitc-Birkhoff interpola- 
tion problems, Siam J. Numer. Anal. 6 (1969), 230-235 

P. Berckmann, Orthogonal polynomials for engineers and physicists, The Golem Press, 
Boulder, Colorado, 1973. 

C. Bernardi and Y. Maday, Approximations spectrales de problemes aux limites elliptiques, 
Spingcr-Vcrlag France, Paris, 1992. 

B. Bojanov B. and G. Nikolov, Comparison of Birkhoff type quadrature formulae, Math. 
Comput., 54(1990), pp. 627-648. 

B. Bojanov, G. Grozev and A. A. Zhensykbaev, Generalized Gaussian quadrature formulas 
for weak Chcbychcv systems, in Optimal Recovery of Functions, B. Bojanov and H. Wozni- 
akowski, eds., Nova Sciences, New York, 1992, pp. 115-140. 

B. Bojanov and A. Guessab, Gaussian quadrature formula of Birkhoff's type, Calcolo 
34(1997), pp. 41-50. 

B. Bojanov, An inequality of Duffin and Schacffer type, East J. Approx. 1(1995), pp. 37-46. 
B. Bojanov, Total positivity of the spline kernel and its applications, M. Gasca and C. A. 
Micchelli, eds., Total positivity and its applications, 3-34, Kluwer Academic Publishers, 1996, 
pp. 3-34. 

T. S. Chihara, An introduction to orthogonal polynomials, Math. Appl., vol. 13, Gordon & 
Breach, New York, 1978. 

Ph. J. Davis and Ph. Rabinowitz, Methods of numerical integration, Academic Press, London, 
1984. 

D. K. Dimitrov, On a problem of Turan:(0,2) quadrature formula with a high algebraic degree 
of precision, Acquationes Math. 41(1991), 230-235. 

D. K. Dimitrov, Lacunary quadrature formulae and interpolation singularity, J. Approx. 
Theory 75(1993), 237-247. 

E. A. Van Doom, Representation and bounds for zeros of orthogonal polynomials and eigen- 
values of sign-symmetric tri-diagonal matrices, J. Approx. Theory, v. 51, 1987, pp. 254-266. 
S. Elhay and J. Kautsky, IQPACK: Fortran subroutines for the weights of interpolatory 
quadratures, School of Mathematical Sciences. The Flinders University of South Australia, 
April 1985. 

A. Ezzirani, Construction de formules de quadrature pour des systemes de Chebyshev avec 
applications aux methodes spectrales, These de l'Univcrsite de Pau, France, 1996. 
A. Ezzirani and A. Guessab, A fast algorithm for Gaussian type quadrature formulae with 
mixed boundary conditions and some lumped mass spectral approximations, to appear in 
Math. Comp. (1998). 

W. Gautschi, A survey of Gauss-Christoffcl quadrature formulae, in E. B. Christoffel, P. L. 
Butzer and F. Feher, eds., Birkhauser, Basel, 1981, pp. 72-147. 

W. Gautschi, Algorithm 726: ORTHPOL — A package of routines for generating orthogonal 
polynomials and Gauss-type quadrature rules, ACM Trans. Math. Software 20(1994), 21—62. 
W. Gautschi, On the construction of Gaussian rules from modified moments. Math. Comp. 
24(1970), 245-260. 

S. Gocdcckcr, Remark on algorithms to find roots of polynomials. SIAM J. Sci. Computing 
15(1994), pp. 1059-1063. 

G. H. Golub and J. H. Welsch, Calculation of Gauss quadrature rules, Math. Comp., 23(1969), 
pp. 221-230. 

G. H. Golub, Some modified matrix eigenvalue problems, SIAM Rev., 15(1973), pp. 318-334. 
G. H. Golub and J. Kautsky, Calculation of Gauss quadratures with multiple free and fixed 
knots, Numer. Math., 41(1983), 147-163. 



168 



ALLAL GUESSAB 



[25 
[26 
[27 
[28 
[29 

[30 
[31 

[32 

[33 

[34 
[35 
[36 

[37 
[38 

[39 
[40 

[41 
[42 
[43 
[44 
[45 
[46 



A. Guessab and G. V. Milovanovic, An estimate for coefficients of polynomials in L norm, 
Proc. Amcr. Math. Soc, 120(1994), pp. 165-171. 

A. Guessab and Q. I. Rahman, Quadrature formulae and polynomial inequalities, J. Approx. 
Theory, 182(1997), pp. 255-282. 

A. Guessab, A fast algorithm to compute Gaussian quadrature formulae for spline function, 
U.A. CNRS 1204. 9(1998). 

S. Karlin and W. J. Studden, Chebyshev systems: with applications in analysis and statistics, 
John Wiley & sons, inc. 1966. 

M. G. Krein, "The ideas of P. L. Chebyshev and A. A. Markov in the theory of limiting 
values of integrals and their further developments, Uspckhi Fiz. Nauk, 1951, 3-120 (Russian); 
Amer. Math. Soc. Transl. Scr. 2, 12 (1951), pp. 1-122. 

B. N. Parlet, The symmetric eigenvalue problem, Prentice-Hall, Englewood Cliffs, 1980. 

G. Labelle, Concerning polynomials on the unit interval, Trans. Amer. Math. Soc, (1968), 
pp. 321-326. 

C. A. Micchclli and T. J. Rivlin, Numerical integration rules near Gaussian formulas, Israel 
J. Math., v. 16(1973), pp. 267-299. 

C. A. Micchelli and T. J. Rivlin, Quadrature formulae and Hcrmitc-Birkhoff interpolation, 
Advances in Mathematics, 11(1973), pp. 93-112. 

G. V. Milovanovic, Construction of s-orthogonal polynomials and Turan quadratures, in 
Approx. Theory III, Nis, 1987, G. V. Milovanovic, cd. Univ. Nis, 1988, pp. 311-328. 
G. V. Milovanovic, D. S. Mitronovic and Th. M. Rassias, Topics in polynomials: Extremal 
problems, inequalities and zeros, World Scientific Publishing, 1994. 

F. Peherstorfer, Characterization of positive quadrature formulae, SIAM J. Math. Anal., 
12(1981), pp. 935-942. 

F. Peherstorfer, Characterization of positive quadrature formulae II, Siam. J. Math. Anal., 
15(1984), pp. 1021-1030. 

F. Peherstorfer, On positive quadrature formulas, in Numerical integration IV, International 
Series of Numerical Mathematics, H. Brass and G. Hammerlin, eds., Birkhauser Verlag Basel, 
112(1993), pp. 297-313. 

A. Ronveaux and W. V. Assche, Upward extension on the Jacobi matrix for orthogonal 

polynomials, J. approx. Theory, 86(1996), 335-357. 

H. J. Schmid, A note on positive quadrature rules, Rocky Mountain J. Math., v. 19, 1989, 

pp. 395-404. 

D. D. Stancu and A. H. Stroud, Quadrature formulas with simple and multiple fixed nodes. 

Math. Comp., 17(1963), 384-394. 

G. Szego, Orthogonal Polynomials, Colloquium Publication, v. 23, 4th ed., Amer. Math. Soc, 
Providence, R. I., 1975. 

P. Turan, On some open problems of approximation theory, J. Approx. Theory 29(1980), 

23-85. 

D. S. Watkins, Some perspectives on the eigenvalue problem, Siam Review, 35(1993), pp. 

430-471. 

Y. Xu, Quasi-orthogonal polynomials, quadrature, and interpolation, J. Math. Anal. Appl., 
182(1994), pp. 779-799. 

Y. Xu, A characterization of positive quadrature formulae, Math. Comp., 62(1994), pp. 
703-718. 



Current address: Department of applied mathematics, University of Pau, 64000, Pau, France. 
E-mail address: allal.guessabOuniv-pau.fr 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS,VOL.3,NO.2,169-185,2005,COPYRIGHT 2005 EUDOXUS PRESS, LLC 

A Cyclic Subgraph Methodology for 
Estimating de Bruijn Weight Class Distributions 

Gregory L. Mayhew 

Boeing Phantom Works 

P.O. Box 516 

MC S306-5145 

St Louis, MO 63166-0516 

tel: 314-232-0667 

email: gregory.l.mayhew@boeing.com 

Abstract 

Order n de Bruijn sequences are the period 2 n binary sequences produced by an n stage feedback 
shift register. At present, determining the weight class distribution of the gen-erator functions is 
theoretically and computationally intractable for all but the smallest n. This paper combines 
graphical, combinatorial, and numerical techniques to develop a method, illustrated by n=7, that 
estimates an unknown de Bruijn weight class distribution. 

1. Introduction 

The order n de Bruijn sequences are the 2 2 ~ n period 2 n binary sequences in which every 

binary n-tuple occurs exactly once. The de Bruijn sequences are equivalent to Hamiltonian 
circuits through the de Bruijn graph. The de Bruijn sequences can be algebraically constructed 
by n- stage nonlinear feedback shift registers and categorized by the Hamming weight of the truth 
tables of the generating functions [1]. The complete weight class distributions for de Bruijn 
sequences are known for n < 6 [2]. For n > 7, the extreme weight classes can be calculated by 
counting rooted trees of the PCR and CCR decompositions of the state vector space V(n) [1]. 
Since a similar algorithm is not known yet for the interior weight classes, limited information has 
been obtained by exhaustive computer searches [2, 3]. 

The de Bruijn graph contains many cyclic subgraphs which have equivalent representations 
as short cycles in the feedback function x n © g(x n .i ... X2 xO [4, 5]. These short cycles are 
restrictions on the odd weight truth tables for any given order n. Since exhaustive testing of 
truth tables is not feasible currently for most orders n, subsets of truth table states can be tested 
for the existence of short cycles. Convolving results from disjoint subsets produces an upper 
bound on each weight class in an order n distribution. The numerical and combinatorial effects 
of these short cycle restrictions greatly improve the bound over the combinatorial bound 
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obtained by counting all the cycle decompositions possible for any odd weight truth table. The 
underlying weight class distribution can be estimated by combining the cyclic subgraph bound 
with the symmetry groups [6] in an order n and the ratio of remaining truth tables to de Bruijn 
sequences. 

2. Weight Classes 

The weight w of x n © g(x n _i ... X2 xO is the number of logical ones (Hamming weight) among 
the 2 n_1 entries in the truth table of g(x n _i ... X2 xi). Truth tables which produce de Bruijn 
sequences have odd weight between the minimum weight Z(n)-1 and the maximum weight 
2 n_1 -Z*(n)+l , inclusive [1]. Z(n) and Z*(n) are the number of cycles from the pure and 
complementing cycling registers, respectively. 

3. Combinatorial Upper Bound 

Let C(j,r) denote the binomial coefficient j choose r [7]. For odd weight feedback functions 
which produce de Bruijn sequences, the values at g(0) and g(2 n_1 -l) are 1. Each order n odd 
weight class w has C(2 n_1 -2,w-2) truth table candidates. Using order n = 7 to illustrate that this is 
a weak upper bound, C(62,w-2) > 2 for all odd w in 27 < w < 39, which means that this naive 
upper bound would allow placement of all order 7 de Bruijn sequences into any one of several 
odd weight classes. However empirical evidence indicates that all odd weight classes in the 
proper weight range are nonempty and that the overall de Bruijn weight class distribution has a 
Gaussian shape [2]. The Cyclic Subgraph Methodology uses short cycles to significantly reduce 
this combinatorial upper bound. 

4. Ratios 

Collectively for an order n > 7, the ratio of truth table candidates to de Bruijn sequences 
approaches 2 n " . At n = 7, the total number of C(2 n_1 -2,w-2) truth table candidates is 

2,305,741,1 18,628,006,748 and the total number of 2 n de Bruijn sequences is 
144,115,188,075,855,872. The order 7 candidate to sequence ratio is 15.999292992. Within the 
Cyclic Subgraph Methodology, the quality or "goodness" metric is the reduction of this 2 n " 3 ratio. 

Individually by weight class w within an order n, this 2 n ~ 3 ratio is closely approximated only 
at the middle weights 2 n " -1, 2 n ~ +1, and 2 n " 2 +3. The candidate to sequence ratio is significantly 
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higher than 2 n ~ 3 at the minimum weight Z(n)-1 and the maximum weight 2 n_1 -Z*(n)+l. 
Nevertheless, a first approximation to an order n weight class distribution is C(2 n_1 -2,w-2) / 2 n " 3 
for each valid weight w. Later, this approximation will be compared with the results from the 
Cyclic Subgraph Methodology. 

5. Cyclic Subgraphs and Short Cycles 

Most feedback functions with odd weight produce cycles with lengths k < 2 n . These short 
cycles are cyclic subgraphs of the de Bruijn graph as shown in Figures 1 and 2. Let (3(n,k) 
denote the number of distinct length k cycles producible in an n stage shift register using the 
feedback function x n © g(x n _i ... X2 xi). 

The distinct and canonical forms of the length k cycles are the cyclic equivalency classes 
(CEC) in a Bounded Synchronization Code [1]. The number of CECs is 



/3(k,k) = lY J M(d)2 k/d =jY,4'l] 2 



* d\k * d\k V" 

For each length k cycle, there exists some shift register length m that is the shortest shift 
register which can produce this particular length k cycle and guarantee each state has a unique 
successor. This value m is the minimum span of the length k cycle. A length k cycle and its 
minimum span is called an [m,k] sequence [5]. (Note m is assigned here instead of the usual n to 
distinguish the minimum span m from the order n de Bruijn shift register.) 

An [m,k] sequence is a sequence of least period k in which all successive k sets of m adjacent 
digits (called 'm windows') are distinct. If m < n, then the length k cycle is producible in an n 
stage shift register with a unique successor for each state. If m > n, then the length k cycle is not 
producible in an n stage shift register with a unique successor for each state. For example, the 
length 7 cycle (0101 1 1 1) is a [4,7] cycle but not a [3,7] cycle. As will be explained later, when 
applied to a particular order n, Cyclic Subgraph Methodology uses [m,k] sequences with m < n. 

Figure 1 illustrates the following cyclic subgraphs for an n=4 stage shift register: the short 
cycle (1) has length k=l, has minimum span m=l, and is a [1,1] sequence; the short cycle (01) 
has length k=2, has minimum span m=2, and is a [1,2] sequence; the short cycle (Oil) has 
length k=3, has minimum span m=2, and is a [2,3] sequence; the short cycle (0001) has length 
k=4, has minimum span m=3, and is a [3,4] sequence; and the short cycle (001 1 1) has length 
k=5, has minimum span m=3, and is a [3,5] sequence. 
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Figure 1 : Cycles with Lengths 1 through 5 




Figure 2: Cycle with Length 14 
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Table I 
[m,k] Sequences for k < 8 



Cycle 
length k 


Minimum 
span m 


[m,k] Sequences 


1 


1 


0, 1 


2 


1 


01 


3 


2 


001, Oil 


4 


2 


0011 




3 


0001, 0111 


5 


3 


00011, 00111 




4 


00001, 00101, 01011, 01111 


6 


3 


000111, 001011, 001101 




4 


000011, 000101, 001111, 010111 




5 


000001, 011111 


7 


3 


0001011, 0001101, 0010111, 0011101 




4 


0000101, 0000111, 0001111, 0101111 




5 


0000011, 0010011, 0011011, 0011111 




6 


0000001, 0001001, 0101011, 0110111 
0010101, 0111111 


8 


3 


00010111, 00011101 




4 


00001011, 00001101, 00001111, 00010011, 
00011001, 00101101, 00101111, 00110111, 
00111011, 00111101 




5 


00000101, 00000111, 00011011, 00011111, 
00100111, 00101011, 00110101, 01011111 




6 


00000011, 00001001, 00010101, 00111111, 
01010111, 01101111 




7 


00000001, 01011011, 00100101, 01111111 



Figure 2 illustrates the cyclic subgraph for an n=4 stage shift register with the short cycle 
(00001 101 1 1 1001) that has length k=14, has minimum span m=4, and is a [4,14] sequence. 

Table I categorizes the CECs for k < 8 into [m,k] sequences using the lexographically least 
member of each CEC. Any truth table x n © g(x n _i ... X2 xi) which produces an [m,k] sequence 
cannot produce a de Bruijn sequence. In a subsequent section, these [m,k] sequences become the 
basis of tests for length k short cycles and these tests expedite the identification of truth tables 
that cannot produce de Bruijn sequences. 
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6. Enumerating P(n,k) 

Observe P(n,k) is the total number of [m,k] sequences with length k and miniminum span 
m < n. To enumberate P(n,k), first construct all cycles with length k, second test the minimum 
span of each length k cycle, and third tally all length k cycles with m < n. 

The testing for minimum span proceeds directly from the notion of 'm windows' . Let 
(ao, ai, ..., a k _i) be a sequence having least period k from a CEC. By definition, (a , ai, ..., a k _i) is 
an [m,k] sequence only if there does not exist any i and j, < i < k, 0<j<k, for which the 
(i+l)th 'm window' equals the (i+j+l)th 'm window', where indices are computed mod k. This 
'm window' testing process is easily programmed on a computer. 

Table II summarizes the combinations of n and k at which formulas for P(n,k) are known [4, 
5]. The function (j)k, r is defined as the number of integers t < k satisfying (k,t) < r, where (k,t) 
denotes the greatest common divisor of k and t. The applicable domains for these formulas are 
illustrated in Figure 3. 

Table III extends previous results by enumerating P(n,k) in the unsolved region labelled D in 
Table II and in Figure 3. An arrow indicates that the value perpetuates in that row or column. 
When the Cyclic Subgraph Methodology is applied to a particular order n, P(n,k) represents the 
maximum number of tests for short cycles at each length k. 

7. Short Cycle Testing 

The [m,k] sequences become the basis for tests which expedite the identification of truth 
tables that cannot produce de Bruijn sequences. By this method alone, testing truth tables for 
subcycles up to one-third the sequence length is required to verify a particular truth table 
produces a de Bruijn sequence. In practice, the majority of invalid truth tables can be quickly 
eliminated with some subcycle testing and the remaining truth tables tested by attempting to 
construct a full period sequence from the particular truth table. 

Theorem: If m < n, then a cycle of length k with minimum span m can be structured into a 
logical test for an n stage shift register that eliminates truth tables which contain that particular 
length k cycle. 
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Table II 
Formulas for P(n,k) 



Region 


Domain 


P(n,k) 


A 


n>k 


P(n,k) = P(k,k) 


B 


n = k 


/3(n,k) = /3(k,k) 


C 


< s = k-n 
<l/4k+l 


t=\ ;'=0 2%<l+(s -d,-j)h 




n = k- 1 


P(n,k) = P(k,k) 




n = k-2 


P(n,k) = P(k,k)-<fi k 




n = k-3 


P(n,k) = P(k,k)-2<f> k2 +2 




n = k-4 


P(n, k) = P(k, k) - 4<f> k 3 - 2(k,2) + 10 




n = k-5 


P{n, k) = P(k, k) - 8^4 - (k,3) + 19 




n = k-6 


^(n, k) = P(k, k) -16^ j5 - 4(fc,2) - 2(fe,3) + 48 


D 




UNSOLVED 


E 


k = 2 n 


P(n,k) = 2 2 " " de Bruijn sequences 


F 


k>2 n 


y#(n,A:) = 









Shift Reg 


ister Sts 


ges 


n 








2 3 4 5 6 7 8 9 10 11 12 


2 


B 






3 


C 


B 






4 


E 




B 






5 






B 




A 


6 








B 






M 7 








B 






M 8 


E 








B 






C 

X 9 


F 








B 






3 io 








B 






U 11 








B 




12 


C 




B 


13 








14 




D 




15 








16 




E 






17 









Figure 3: Domains for P(n,k) Formulas 
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Table III 
Values of p(n,k) for n < 12 and k < 17 



k\n 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


P(k,k) 


2 


1 


1 


-> 


















1 


3 


2 


2 


-> 


















2 


4 


1 


3 


3 


-> 
















3 


5 





2 


6 


6 


-> 














6 


6 





3 


7 


9 


9 


-> 












9 


7 


I 


4 


8 


12 


18 


18 


-> 










18 


8 




2 


12 


20 


26 


30 


30 


—> 








30 


9 







14 


32 


46 


50 


56 


56 


-> 






56 


10 







17 


57 


73 


85 


95 


99 


99 


-> 




99 


11 




4 


14 


78 


124 


154 


168 


176 


186 


186 


-> 


186 


12 






13 


113 


217 


271 


309 


325 


331 


335 


335 


335 


13 






12 


154 


348 


482 


552 


590 


608 


618 


630 


630 


14 






20 


208 


574 


877 


1009 


1083 


1119 


1139 


1155 


1161 


15 






32 


300 


944 


1502 


1826 


1996 


2102 


2142 


2168 


2182 


16 






16 


406 


1528 


2638 


3370 


3718 


3894 


3986 


4038 


4080 


17 









538 


2456 


4618 


6066 


6872 


7282 


7496 


7600 


7710 



Proof: Let (ao, ai, ..., ak-i) be a sequence having least period k from a CEC. Length k 
subcycles are constructed in an order n shift register as follows. The methods are slightly 
different for k < n and n < k < 2 n , where k is the cycle length and n is the register length. 
For k < n, the fundamental sequence is extended cyclically until its new length k > n. Once n < 
k or k < 2 n , the 'n windows' test is the following. For < j < k-1 construct k 'n+1 windows', 
(aj, aj+i, ..., aj+n-i, aj+n), where indices are computed modulo k. Note that (a_j, aj+i, ..., aj +n -i) 
represents the truth table state (x n x n _i ... X2 x0 and (aj+ n ) represents the value of x n © g(x n .i ... X2 
xi) at that state. Because the bottom half of the truth table is the complement of the top half for 
a de Bruijn sequence, all conditions on x n © g(x n _i ... X2 Xi) can be translated to conditions on 
g(x n _i ... X2 Xi). A valid test exists on g(x n _i ... x 2 Xi) for that k cycle if four conditions are 
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satisfied. First, all 'n windows' are distinct because m < n. Second, if the all zeros state is part 
of the k cycle, then g(0)=l. Third, if the all ones state is part of the k cycle, then g(2 n_1 -l)=l. 
Fourth, translation of truth table state assignments from the bottom half to the top half does not 
create any conflicting assignments for g(x n _i ... X2 xi) at some state because m < n. 

As examples, consider length k cycles in a n=7 stage shift register. The length 2 cycle (01) 
exists if g(21)=l and g(42)=l. The length 3 cycle (001) exists if g(9)= 1 and g(18)=0 and 
g(36)=l. Similarly, the length 4 cycle (01 1 1) exists if g(29)=0 and g(46)=0 and g(55)=l and 
g(59)=l. Likewise, the length 9 cycle (000100101) exists if g(4)=0 and g(9)=0 and g(10)=l and 
g(17)=l and g(18)=l and g(20)=0 and g(34)=0 and g(37)=0 and g(40)=l. 

Note that different length k cycles can place different conditions on some subset of states. 
Again, as an example, consider length k cycles in a n=7 stage shift register. The length 3 cycle 
(001) requires g(9)=l and g(18)=0 whereas the length 9 cycle (000100101) requires g(9)=0 and 
g(18)=l. 

Theorem: If a feedback function g(x n _i ... X2 xO with odd weight w, Z(n)-1 < odd weight 
< 2 n " -Z*(n)+1, does not produce any length k cycle for 1 < k < 273, then g(x n _i ... X2 xO 
produces a de Bruijn sequence. 

Proof: For n > 2, the number of cycles into which the n dimensional vector space V(n) is 
decomposed by a shift register is even or odd according to whether weight w is even or odd [1]. 
All truth tables producing de Bruin sequences have odd weight. So if one cycle is not produced, 
then three is the least number of subcycles possible. Any subcycle longer than 273 also 
produces a subcycle shorter than 273, so subcycle testing up to 273 determines whether or not 
the truth table produces a period 2 n sequence. 

8. Cyclic Subgraph Methodology 

The exact and complete weight class distribution for any order n > 7 could be determined by 
producing all feedback functions g(x n _i ... X2 xO with odd weight, testing these feedback 
functions for the P(n,k) cycles with length k < 273, and then counting only those feedback 
functions which produce full period sequences. Testing for all short cycles with lengths 
2 < k < 273 requires variable access across all states in g(x n _i ... X2 xi). However, this approach 
currently is not feasible due to combinatorial size with readily available computer resources. 
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On the other hand, partitioning the states into feasible combinatorial sizes, testing these 
disjoint partitions for short cycles with lengths 2 < k < 273, and then convolving the results is an 
approach that is currently feasible. Some short cycle tests may not be included because they 
could have states in more than one partition. Also, the lengths of cycles tested could be less than 
273 because the partitions may not have sufficient states to support the longer cycles. As a 
result, this alternate approach produces an upper bound rather than counting exactly the number 
of de Bruijn sequences in any or all weight classes for that order n. 

The Cyclic Subgraph Methodology for a particular order n has these steps. 

1) Determine the cardinality s of the largest set of states for which all 2 s conditions in 
g(x n _i ... X2 xi) can be examined for length k cycles. 

2) Set length k=2. Initialize all partitions Pi to the null set of states from g(x n _i ... x 2 xi). 

3) Select an unassigned short cycle with length k from the [m,k] sequences with m < n. 
If all short cycles of length k with m < n have been considered, then increment k 
while 2 < k < 273 and return to the start of step 3. 

4) Determine the test conditions on states in function g(x n _i ... x 2 xi) for the existence this length 

k cycle. Valid conditions are produced only when the [m,k] sequence has m < n. 

5) The states used in the test for that length k cycle are assigned to some partition 
{Po, Pi, ..., Pj} of states of g(x n _i ... x 2 xi) provided these conditions are satisfied: 
i) if all of the states do not exist in any partition, create a new partition. 

ii) if some of the states exist in one partition and the other states do not exist in any 
partition, add the unassigned states to the existing partition provided the 
maximum partition size s is not exceeded (otherwise discard the length k cycle). 

iii) if some of the states exist in one partition and the other states exist in another 
partition, merge the two partitions provided the maximum partition size s 
is not exceeded (otherwise discard the length k cycle). 

6) Return to step 3 until short cycles with increasing lengths cannot be included due to 
the constraints on partitions (e.g., size, disjoint). 

7) For each partition {Po, Pi, ..., Pj}, build all 2 P binary patterns on the p = IPjl states in 
partition Pj. Test each pattern for the short cycles which defined that partition. 
Patterns with short cycles are eliminated. Count the remaining patterns by weight. 
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8) The overall result, the count of the remaining patterns by total weight w, is obtained 
by combining the partial results from two partitions at a time until the results of all 
partitions {Po, Pi, ..., Pj} are merged. Suppose two partitions P t and P u contain t and u 
states, respectively. The count of surviving patterns of weight r with < r < w is 
obtained by combining two partitions at a time using the Vandermonde Identity [7] 



t +u 



s 

V r J ;=oV' 



C 
u 

\r~ i 



9) Determine the performance metric for the results. For an individual weight class w, 
compare the final results with C(2 n_1 -2,w-2). For a weight class distribution for order 
n, sum the final results for all weight classes, divide this sum by the number of order 
n de Bruijn sequences, and compare the quotient with 2 n " 3 . 

Note that step 3 gives preference to the shortest available subcycle since intuitively a shorter 
subcycle eliminates more invalid truth tables than a longer subcycle. Furthermore, step 7 
eliminates truth tables which produces short cycles and counts the survivors. Invariably, if there 
is more than one partition and step 7 does not use all cycles up to length 273, then the results are 
an upper bound rather than the exact number of de Bruijn sequences. Also, if a particular 
partition was determined by a small number of k cycles, then the number of surviving binary 
patterns on the states could be counted analytically using inclusion / exclusion [7] rather than by 
exhaustive testing. 

9. Order 7 Cyclic Subgraph Bound 

Order 7 is the smallest order for which the complete weight class distribution is unknown. 
Order 7 also has some partial results available for comparative purposes so this order illustrates 
the Cyclic Subgraph Methodology application. 

The computations were performed using a Sun Ultra 60 workstation which contains a 
Superscalar Ultra SPARC-II version 9 CPU operating at 300 MHz, 2 Megabyte cache, 4 
Gigabyte of RAM, and Solaris 6.2 operating system. The numerical algorithms were written in 
the C programming language. 

Given these computer resources, the limit on partition size was 36 states. This was borne 
out in the CPU time profile discussed later. The initial partitions were determined by the length 
2, 3, 4, and 5 cycles. All possible combinations of merging these partitions were considered 
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based on including the maximum number of length 6 and 7 cycles while not exceeding 36 states 
total. This resulted in two partitions labelled River and Ocean (for convenience) as shown in 
Table IV. Note that the length 6 cycles (00101 1), (001 101), and (00001 1) are not included 
because they each have states in both partitions. 



Table IV 
Summary of Order 7 Partitions 



Name 


States in g(x6XsX4X3X2Xi) 


Short Cycles k < 7 in Partition 


River 

(26 states) 


1, 2, 3, 4, 5, 8, 9, 10, 
11,16,17,18,19,20,21,32, 
33, 34, 36, 37, 40, 41, 42, 48, 
50, 52, 


01,001,0001, 

00101, 00001, 

000101, 000001, 

0000101, 0000001, 0001001, 0010101 


Ocean 

(36 states) 


6, 7, 12, 13, 14, 15, 22, 23, 
24,25,26,27,28,29,30,31, 
35, 38, 39, 43, 44, 45, 46, 47, 
49,51,53,54,55,56,57,58, 
59, 60, 61, 62 


011,0011,0111, 

00011,00111,01011,01111, 

000111,001111,010111,011111, 

0001111,0101111,0011011,0011111, 

0110111 



Cycles up to length 25 were included as possible in the appropriate partition. Table V 
summarizes the number of short cycle tests included in each partition, where P(n,k) defines the 
maximum number of cycles which could have been included at any length k. 

All 2 26 and 2 3 binary patterns on the states in the River and Ocean partitions, respectively, 
were constructed, examined for short cycles, and tallied by weight if they survived. The Ocean 
partition was tested up to length 10, 13, 16, and 25 cycles in order to investigate the point of 
diminishing returns for including longer cycle lengths. The CPU hours required and the percent 
of remaining Ocean patterns are shown in Table VI. The Ocean program execution time in CPU 
hours exhibited a profile given by 

' 36 l(3-10- 9 )3< k - 10 ) /3 



where r is the pattern weight and k is the maximum length subcycle tested. 
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Table V 
Short Cycle Test Summary for Cyclic Subgraph Upper 
Bound of Order 7 de Bruijn Weight Class Distribution 



Short 
Cycle 
Length k 


P(7,k) 


Total 

Cycles 

Tested 


River 

Cycles 

Tested 


Ocean 
Cycles 
Tested 


2 


1 


1 


1 





3 


2 


2 


1 


1 


4 


3 


3 


1 


2 


5 


6 


6 


2 


4 


6 


9 


6 


2 


4 


7 


18 


9 


4 


5 


8 


30 


12 


4 


8 


9 


50 


17 


5 


12 


10 


85 


22 


5 


17 


11 


154 


30 


7 


23 


12 


271 


44 


9 


35 


13 


482 


65 


13 


52 


14 


877 


97 


18 


79 


15 


1,502 


116 


14 


102 


16 


2,638 


152 


21 


131 


17 


4,618 


209 


13 


196 


18 


8,105 


292 


16 


276 


19 


14,262 


396 


12 


384 


20 


24,931 


499 


14 


485 


21 


43,912 


668 


26 


642 


22 


76.236 


861 


20 


841 


23 


132,632 


1,164 


36 


1,128 


24 


229,990 


1,446 


16 


1,430 


25 


397,260 


1,700 





1,700 



Table VI 
Ocean Partition Profile 



Maximum 
Length 
Cycle k 


CPU Hours 


Percent 

Patterns 

Remaining 


10 


206 


55.734032 


13 


620 


53.047629 


16 


1,860 


51.866302 


25 


50,145 


50.959738 
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The River and Ocean results were combined for each weight class w using 

vv-2 

Y,26 River i 360cean w _ 2 _i 
(=0 

(dual of Vandermonde Identity) to yield an upper bound for that weight class as shown in Table 
VII. Summing all weight class bounds and dividing by the 2 sequences yields an overall ratio 
of 4.077159482. The Cyclic Subgraph Methodology metric is a 74.52 percent reduction of the 
15.999292992 metric from the original combinatorial based upper bound on weight class sizes. 



Table VII 

Upper Bounds on Weight Class Distribution 

of Order 7 de Bruijn Sequences 



Weight 
Class 


Naive Upper 
Bound C(62,w-2) 


Cyclic Subgraph 
Upper Bound D(7,w) 


19 


739,632,519,584,070 


98,666,900,393,138 


21 


4,282,083,008,118,300 


774,674,197,918,760 


23 


18,412,956,934,908,690 


4,017,542,518,481,138 


25 


59,678,358,445,158,600 


14,542,557,734,155,318 


27 


147,405,545,359,541,742 


38,106,646,712,313,044 


29 


279,692,573,246,309,972 


74,084,027,212,710,224 


31 


409,894,288,378,212,890 


108,641,336,612,781,434 


33 


465,428,353,255,261,088 


121,460,568,740,676,744 


35 


409,894,288,378,212,890 


104,147,518,340,124,508 


37 


279,692,573,246,309,972 


68,634,482,648,537,036 


39 


147,405,545,359,541,742 


34,714,653,123,132,692 


41 


59,678,358,445,158,600 


13,413,012,490,444,814 


43 


18,412,956,934,908,690 


3,927,199,071,108,794 


45 


4,282,083,008,118,300 


861,077,229,690,120 


47 


739,632,519,584,070 


139,077,193,488,948 


49 


93,052,749,919,920 


16,177,749,516,244 


51 


8,308,281,242,850 


1,313,493,786,292 


53 


508,271,323,092 


71,175,705,612 


55 


20,286,591,270 


2,406,502,980 


Total 


2,305,741,118,628,006,748 


587,580,605,551,467,840 


Ratio 


15.999292992 


4.077159482 
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10. Order 7 Weight Class Distribution 

A combinatorial based approximation to an order n de Bruijn weight class distribution is 
given by C(2 n_1 -2,w-2) / 2 n " 3 for each odd weight w, Z(n)-1 < w < 2 n " 1 -Z*(n)+l. This is each 
weight class binomial coefficient size divided by the overall ratio of the sum of the weight class 
sizes to the number of de Bruijn sequences. 

Using the Cyclic Subgraph Methodology, the underlying order n weight class distribution is 
estimated by combining the calculated upper bound with the symmetry groups in an order n and 
the ratio of remaining truth tables to de Bruijn sequences. 

Let D(n,w) be the upper bound of the weight class w which results from the Cyclic Subgraph 
Methodology. Let R(n) be the overall ratio of the sum of the weight class sizes to the number of 
order n de Bruijn sequences. Let E(n,w) be the estimate of the number of order n de Bruijn 
sequences in each weight class w. Then R(n) and E(n,w) are 

2 n -'-Z*(")+l / [ 

R(n)= ^D(n,w) I 2 " ~ n E(n,w) = D(n,w)/R(n) . 

Z(n)-1 / 

The de Bruijn weight classes within an order n contain large symmetry groups [6]. Thus the 
estimate for each weight class is conveniently and more concisely expressed in terms of the 
coefficient in each weight class. 

Order 7 is the smallest order for which the complete weight class distribution is unknown. 
Order 7 also has some partial results available [2, 3] for comparative purposes so this order will 
be used to illustrate and apply this weight class estimating methodology. In order 7, each weight 
class has a symmetry group of 2 26 sequences [2, 6]. The coefficient estimates for the order 7 de 
Bruijn sequences per weight classes are shown in Table VIII. The first estimate is given by 
C(62,w-2) / (15.999292992 • 2 26 ). The second estimate is D(7,w) / (4.077159482 • 2 26 ), where 
D(7,w) is the appropriate entry from Table VII. Actual coefficients are listed where known. 

In all order 7 weight classes for which the actual coefficient is known, the Cyclic Subgraph 
Methodology estimate is much closer than the combinatorial based value. Presumably, the 
Cyclic Subgraph Methodology estimate at weight class 19 would have been even closer if more 
length 7 cycle tests could have been included. The Cyclic Subgraph Methodology estimates also 
suggest that the middle weight classes 2 n " 2 -l and 2 n " 2 +l contain more order 7 de Bruijn 
sequences than suggested by the combinatorial bound and 2 n ~ 3 ratio. 
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Table VIII 

Coefficient Estimates for Weight Class 

Distribution of Order 7 de Bruijn Sequences 



Weight 
Class 


Coefficient from 
C(62,w-2) Bound 


Coefficient from 
D(7,w) Bound 


Actual 
Coefficient [2, 3] 


19 


688,867 


360,607 


91,125 


21 


3,988,177 


2,831,271 




23 


17,149,160 


14,683,274 




25 


55,582,259 


53,149,993 




27 


137,288,178 


139,271,787 




29 


260,495,518 


270,761,553 




31 


381,760,673 


397,061,257 




33 


433,483,087 


443,912,857 




35 


381,760,673 


380,637,296 




37 


260,495,518 


250,844,612 




39 


137,288,178 


126,874,763 




41 


55,582,259 


49,021,742 




43 


17,149,160 


14,353,087 




45 


3,988,177 


3,147,056 




47 


688,867 


508,298 




49 


86,666 


59,126 


52,240 


51 


7,738 


4,800 


3,773 


53 


474 


260 


160 


55 


19 


9 


3 


Total 


2,147,483,648 


2,147,483,648 





In the two order 7 weight classes for which intermediate projections are available, the Cyclic 
Subgraph Methodology estimate also is closer that the combinatorial derived estimate. A 
computer program written in C is currently being used to attempt an "exhaustive search" of order 
7 de Bruijn weight classes 21 and 47. Weight classes 21 and 47 have 4,282 trillion and 739 
trillion truth table candidate cases, respectively, for an unknown number of sequences. From 
these searches, the weight class 21 and 47 estimates are 1,259,021 * 2 2 sequences and 
526,578 * 2 sequences, respectively. As seen in Table VIII, both of these estimates are more 
consistent with the estimates from the Cyclic Subgraph Methodology than the estimates from the 
combinatorial upper bound method. Finally, empirical evidence again indicates that the valid 
order 7 weight classes are nonempty and the overall order 7 weight class distribution has a 
Gaussian shape (albeit truncated on the low weight side). 
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11. Conclusions 

At present, the weight class distributions of the de Bruijn sequence generator functions are 
theoretically unresolved and computationally intractable for all but the smallest orders n. As a 
benchmark, a computer program was constructed for an exhaustive search of order 7 de Bruijn 
weight classes 21 and 47. With the same workstation resources, this program examines 1 billion 
cases per CPU hour. At this rate, completely examining the order 7 weight classes would require 
2.3 billion CPU hours. 

As an alternative to exact weight class distributions which are currently unattainable, 
graphical, numerical, and combinatorial techniques were combined to develop a method for 
estimating an unknown de Bruijn weight class distribution. With just 50 thousand CPU hours, 
the Cyclic Subgraph Methodology produced a higher fidelity estimate than the combinatorial 
derived approach as measured by the known partial order 7 de Bruijn weight class distribution. 
Perhaps access to a specialized processor such as a hypercube might lead to more data points on 
the order 7 weight class distribution and provide an opportunity to further assess the accuracy of 
the estimates based on the Cyclic Subgraph Methodology presented herein. 
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Abstract. We consider a positive linear operator P m ,n defined on the space 
of functions of two variables having exponential growth at infinity. We study 
the properties of approximation of a function / by means of its image P m ,nf 
and we estimate the order of approximation. 



1 Introduction 

In a recent paper [2], using a method given by A. Jakimovski and D. Leviatan [3], we 

introduced a generalized Szasz type operator. Let us remind it and some of its properties. 

We denote the weight function by w p (x) = e~ px , x > 0, where p > and we consider 

C p = {/ G C[0, oo) : w p f is uniform continuous and bounded on [0, oo)}, with the norm 

p = sup w p (x)\f(x)\. 

ire[0,oo) 



We define the positive linear operators 

P ™(^ x "> = ~TTT — y2P2k(nx)f I — ) 
cnlcnnx ^-^ \ n I 



fc=0 
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where / G C p , chx is the hyperbolic cosine of x and p 2 k are the polynomials defined by 
the relation 

oo 

(1) ch.uch.ux = / j P2k(x)u 2k , 

k=0 

that is 

, N (l+x) 2k + (l-x) 2k 

P2k{X) = 



2(2k)\ 

We will use some properties of operator P n . 
Lemma A. If x E [0, oo) and n GN, we have 

P n (e ;x) = 1 



and 



1 
Pn\ e u x ) — xthnx H — thl 
n 

P n (e 2 ;x) =x 2 + (l + 2thl)-thnx+ — (1 + thl) 

n n 2 



P n ({t-x) 2 ;x)<-{x + l), 
n 



where ei{x) = x % , i G {0, 1,2} and thx is the hyperbolic tangent of x. 

Lemma B. Let p > 0, r > p and n be a natural number such that Uq > 
For all x G [0, oo) and n > no we have 



p 



In r — In p 



w r (x)PJe pt ;x) < 2 



P t.^/o ch (e p/n ) 



chl 

and 



w r (x)P n ((t - x) 2 e pt ; x) < A p 



ch (e p / n ) x + 1 



chl n 

where A pr is a positive constant depending only on p and r. 

Lemma C. If f G C p and r,p and n Q are numbers like in Lemma B, then 

\\P n f\\r<2\\f\\ p C ^. 
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2 Definition of operators in two variables 

Now we consider an extension of operator P n to the case of two variables. 
We denote by C(R+) the space of real valued functions of two variables, continuous 
on IR^ = {(x,y) : x > 0, y > 0}. We will use the weight function w p (x) = e~ px , x > 0, 
p > and we will consider for p, q > and (x, y) G M+ the weight 

w p ,q(x,y) = w p (x)w q (y). 

Let C P) g — {/ E (7(R+) : tu p , ? / is uniform continuous and bounded on R+}, with the 



norm 



(2) ll/llp,«= SU P w m {x,y)\f{x,y)\. 

(x,y)eR% 

For each / G C VA we consider the following positive linear operator 

1 °° °° fey ■ OjL\ 

(3) *».«(/; *>*) = ^^EE^H^N/ (A -J 



where m, n G N, (x, y) G R+. 

Remark 1. If p 2 k are defined by the relation 



chux = 2_]p2k(x)u 2 



2 k 
P2k\X)V 

k=0 

we obtain 



P2k{X) 






(2k)V 
and we get to the operators considered by M. Lesniewicz and L. Rempulska [4] 

L mn (f;x,y) - l ffH! M!!; (V*J) . 

chmxchny -^ -^ (2j)! (2/c)! \m' n/ 

Remark 2. If f(x,y) = fi(x)f 2 (y) with / x G C p , / 2 G C g , p > 0, q > 0, then for all 
(x, y) G M+ and m, n G N, the following holds 

(4) P m ,n(/; x, y) = P ro (A; x)P n (/ 2 ; y). 
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3 Some properties of P m ?n operators 

In order to study the approximation properties of P m ,n operators we need some 
auxiliary results. First we study the effect of the operators on the test functions. 
Lemma 1. If (x, y) G IR+ and m, n G N the following holds: 

P m ,n{ e o,o', x ,y) = 1 

P m ,n( e ifli X > V) = xthmx H thl 

m 

P m ,n( e o,u x, y) = ythny + -thl 

n 

P m ,n(e2,2] x, y) = x 2 + y 2 + (1 + 2thl) f-thmx + -thny) + ( — + — ) (1 + thl) 

\m n / \m z n z J 

where e 0fi (t, r) = I, e lfi (t : r) = t, e ,i(t, t) = t and e 2 ^(t, r) = t 2 + r 2 . 

Proof. The easiest way to prove these relations is to use remark 2 and Lemma A. 
We can also obtain these results by calculus. For instance 

P m ,n( e ho;x,y) = 2 — y2p2k(ny)y2p 2j (mx) — . 

ch Ichmxchny r-f ~^f m 

From (1) we have: 

OO 

^2p2k(ny) = chlchny. 

fc=0 

By using the derivative of (1) with respect to the variable u and replacing u by 1 and 

x by mx it results 

2 3 1 



EP2j(mx) — = — shlchmx + ajchlshmx. 
rn rn 

Hence 



m m 



Pmn( e i oi x -, y) — xthmx -\ thl. 

m 

Lemma 2. Let p,q > 0, r > p, s > q be fixed numbers. We consider m ,n natural 

numbers such that 

p q 

ttiq > — and n > 



In r — In p In s — In q 
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Then for all m > mo and n > no the following holds 

ch(e p )ch(e' ? ) 



W* m,n || r,s _ ^||/||p,g 



ch 2 l 



Proof. For all (x,y) G IR+, we have 



w rtS (x,y)\P miTl (x,y)\ < 



< w r!S (x, y) — 2 

ch lch.mxch.ny 

According to relation (2), we have 

'2j 2k 



^ P2j(mx)p 2 k(ny) 



f 



j,k=0 



< 



f 



2j 2k 



m n 



m n 



p.q 



P M 5- 
(^ m g y n 



hence 



w r>s (x,y)\P min (x,y)\ <w rtS (x,y) 



ch lchmxchny 



21 2k 



P ,q ^ P2j(mx)p 2 k(ny)e p ^e' - - 
j,k=o 

w r (x)w s (y)\\f\\ M P m (eP t ;x)P n (e^;y) < 



< 



IIP e p ll IIP e 9 ll 

PjQII- 1 m llrll- 1 n° ||s- 



Now we can use Lemma C and we get to the desired inequality. 
Theorem 1. Let p, q > and r > p, s > q be fixed numbers. 
If f G Cp and m ,no are natural numbers such that 



m > 



P 



and no > 



q 



lnr — Inp Ins — In q 

then there is a positive constant K VA ^ r ^ s depending on p, q, r, s such that 



w rt8 (x,y)\P m ,n(f;x,y) - f(x,y)\ < 
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<K p , q ,r, S [\\f%, q \-— + \\f' 



X + l 



m 



iy\\p,q 



y + 1 

n 



Proof. Let {x, y) G R+ be a fixed point. For / E C* _ and all (£, r) G R+ we have 



f(t,T)-f(x,y) = I fL( u , r ) du + fv( x ,v)dv 



and, for m, n G N, 



^m,n(/; a:, y) - f(x, v) = P m ,n (jfu 



(u, r)du; x, y I + 



fv(x,v)dv;x,y) . 



i n 



We multiply this relation by the weight function and we have 



w r ,a(x,y)\P mtn (f',x,y)-f(x,y)\<w rta P mtn [ / f' u {u,r)du;x,y ) + 



+w r ,s(x,y)P m ,n( f v {x,v)dv\x,y 1 . 



According to relation (2) we can write 



f 'f u (u,T)du <\\f x \\ m f e^du< 

J X J X 

<\\f'x\\ P ^ qT (e pt + en\t-x\. 



In the same way we obtain 

f v (x,v)dv 
Using this inequalities it results: 



<WfX, q e pt (e^ + e^)\r-y\. 



w r , s (x,y)\P mtn (f;x,y) - f(x,y)\ < 
< w r Jf' x \\ M P m ,n(e qT (e pt + e px )\t - x\;x,y)+ 

i"Jr,s || Jy || p,q-Lm,n \" \p \& )\T i/|)-£,t/J /i mra ~r -D mr 
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Next we will use remark 2 and we can write 

A m , n = e- rx e- s y\\f' x \\ p , q Pn(e qT ;y){P m (e pt \t - x\;x) + e px P m {\t - x\;x)} 

and 

B m , n = e- rx e- sy \\f y \\ p , q P m {e^x){P n {e^\r - y\;y) + e™P n (\r - y\;y)}. 

We will focus on A m<n and we try to estimate it. 

We can apply Lemma B and Cauchy's inequality and we obtain: 

ch (e q/n ) 
w s (y)Pn(e qT ;y)<2 ^ ; 

and 

w r {x)P m {\t - x\e pt ; x) < ^/^xjpj^xj ■ y/w r (x)P m ((t - x)V;x). 

Also 



w r (x)e px P m (\t-x\;x) < y/P m ((t - x) 2 ;x) ■ e^^ 
By making use of these inequalities and Lemma B we obtain 



A <\\f'\\ Q Ch W K/ 2ch(e^) J~ M^) x + l 



^ r)x \f^^\ = WfX,K P ^ 



lx+ 1 

_l_gv _ •■ ........ 

/;; 
In the same way we obtain 



£>m,n S ||/j;||p,g-^(? 



n 



By adding A mi „ and £> m ,„ we get the desired result. 

4 Estimate of the order of approximation 

Now we will estimate the order of approximation of function / G C PA by operators 
p 
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For u > 0, v > and / G C Pi9 we consider 

A„,„/(a;, y) = f(x + u,y + v)- f(x, y) 
and the modulus of continuity 
(5) w(f,C p , q ;h,6)= sup ||A„^/(-, .)|| P)g . 

0<u<h 
()<v<S 

Theorem 2. Let p > 0, q > 0, f G C p ^ q and let r > p, s > q be fixed numbers. If 
mo, no are fixed numbers such that 

p q 

tuq > — and n > 



lnr — hip Ins — In q 

then there is a positive constant K depending only on p, q, r, s such that 



Wr, a (x,y)\P m>n (f;x,y) - f(x,y)\ < Ku I f,C m ;\ ,\ 

\ V m V n 

holds for all (x, y) G M+ and m > m , n > n . 

Proof. We consider the Steklov mean of / G C Piq 

fh,s=Tx / f(x + u,y + v)dudv 

where h, 5 > and (x,y) G R+. 
We have 

fhA x ,v) ~ f(x,y) = — / [f(x + u,y + v) - f(x,y)]dudv 

h r S 



— I I A uv f(x,y)dudv. 

ho Jo Jo 



Because 

dx v '" hS Jo 
and 



' 5 ' x > y) = TS I [f( x + h,y + v)- f(x, y + v)]dv 



-^( x ' y) = j^ if( x + u,y + $)- f(x + u > y)) du 
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it results that fh,s £ Cp q . 

By making use of relations (2) and (5) we can write successively: 



(6) 



\\h,6 - f\\ P ,q = sup w p , q (x,y)\f hiS (x,y)-f(x,y)\ 

(x,y)€R 2 + 



sup w Pjq (x,y) 



(x,y)m 2 + 



hS 



h r S 



o Jo 



A U)V f(x,y)dudv 



< 



h f S 



< ~ux I I sup w p,q( x iy)\Au,vf(x,y)\dudv 
110 Jo Jo (x,y)m\ 



1 

hS 



h r S 



JO 



\A u ,vf\\p,qdudv < 



*h r& 



< 



(7) 



— / / sup \\A UtV f(-,-)\\ Ptq dudv = u;(f,C Ptq ;h,6). 

110 J Jq 0<u<h 
0<v<6 

Next we can proceed in the same way: 

dfh,s , v df, 



dx 



= sup w P: q(x,y) 

p, q (x,y)£R 2 + 



h,6 



dx 



x,y) 



sup^ w Ptg (x,y) — 



(x,y)e 



[f(x + h,y + v)- f(x, y + v)]dv 



sup w Pt q(x,y) 



(x,y)m 2 + 



h5 



[f(x + h,y + v)- f(x, y) + f(x, y) - f(x, y + v)]dv 



< 



(x,j/)ei 



< sup w P:g (x,y)— I (\A hiV f(x,y)\ + \A 0:V f(x,y)\)dv 

j2 ltd 



hS J 



\Ah,vf(;-)\\p,q+\\&o,vf(;-)\\p, q )dv< 



<: ~U I SU P \\^u,vf(-,-)\\ P ,qdv = -uj(f,C p>q ;h,5). 

llO ./Q 0<u<h ll 



0<v<S 

In a similar manner one obtains: 



dfi 



li.S 



dy 



/'■'/ 



< fiU(f,Cp, q ;h,6). 



For every fixed (x, y) G M+, r > p, s > q, h, 5 > we have 



w r , s \P m>n (f;x,y) - f(x,y)\ < w r>s {\P m , n (f - f hi s',x,y)\+ 
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+\ p mAfh,6',x,y) - fh,s{x,y)\ + \h,s(x,y) - f(x,y)\}. 
Using Lemma 2 and (6) for all m > itlq and n > no, it results: 

ch(e p )ch(e' ? ) 



Wr,s\Pm,n(f ~ fh,5]X,y)\ < 4||/ - fh,s\\p, 

< 4w(/, C p , g ; /i, <5) 



9 ch 2 l 
ch(e p )ch(e 9 ) 



< 



ch 2 l 

Now we apply Theorem 1 and relations (7), (8) and so, for all m > mo and n > no, 
one obtains: 

Wr,s\Pm,n(fh,S;X,y) - fh,s( x >V)\ ^ 

df h ,6 



<K 



p,q,r,s 



dx 



p. 1 1 



x + 1 



m 



dfi 



h,5 



dy 



/<■'/ 



y + 1 



< 



n 



<2K ma u(f,C p , q ;h,6) ( T^Y^ + ^V^ 
Therefore, for all m > mo, n > no, h, S > it results that: 



w r ,s{x,y)\P m , n {f;x,y) - f(x,y)\ < 



+2K 



p,q,r,s 



1 /x + 1 1 /y + 1 



h\m 5 V ft- 



' ch(e p )ch(e 9 ) 
! ch 2 ! + 



w(/, C Pig ; /i, <5) 



Now we choose h 



x + 1 



S 



y + 1 



and we get to the desired result. 



m \ n 

From this theorem we can extract a result which we also obtained from Lemma 1. 



Corollary. If f G C PA , withp,q > 0, then 



lim P mn (f;x,y) = f(x,y) 



m,n^oo 



for all (x,y) G Ri, the convergence being uniform on every rectangle [0, a] x [0,6], a > 0, 
b>0. 
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On the solution structures of a class of 
semilinear elliptic equations on R d 

Nedra Belhaj Rhouma 
Institut Preparatoire aux Etudes d'Ingenieurs de Tunis. Tunisia 



1 INTRODUCTION 

In this paper, we consider the semilinear elliptic equation of the form 

Lu - kifi(u)u - k 2 f 2 (u)u = in M d , d > 3, (1) 

where the operator L is in the following form: 

d Q2 d Q 

k\ and /c 2 are given Green-tight functions defined on M d , f\ and f 2 are 
nonnegative continuous functions. 

We shall prove the existence of two types of solutions of (1): 

1. Positive solutions u c of (1) satisfying 

u c (x) — ^ c as \x\ — > oo for some c > 0. (2) 

2. Maximal solution ut where 

ut = sup {u(x) : u is a positive solution of (1) such that u > T} . 

Then we give the asymptotic behaviour of the maximal solution. We show 
that if k 2 — \x\~ u near oo for some v > 2, , then 
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U mo (x) = 0(\x\«- 



d 

when J2 \bi\ ^ and 
i=i 



U mo {x) = 0{\x\^) 



d 



when Yj \b%\ = . Moreover, if L = A and f2(y)=y q 1 , ? > 1, then 
i=i 



C/ mo (x)^(|x|^T) 



Equation (1) arises from physics and geometry. We give a description of 
the geometric application of this work (For more details see [5], [6] and [10]). 

Let g = (gij)ij=i,...,d, be a Riemannian metric on M d and k\(x) be its 
scalar curvature. Let /^(x) be a given function on M d . The following 
question has been raised: Can we find a knew metric g\ on M d such that 
k±(x) is the scalar curvature of g\ and g\ is conformal to g ( that is g\ = $g) 

4 

for some positive function $. When d > 3, if $ = u d - 2 ,u > 0, then, this is 
equivalent to the problem of solving the elliptic equation 

-\ -A fl w - fci« - A; 2 m^ = in R d 

a — 2 

where 



1 9 /— ^ , .9 



gr| jj ^1 OXj 

is the Laplace Beltrami operator, \g\ = det(gij) , g l 'i = (g^) -1 . 

Equation (1) has been around for a long time. Ni in [10] proved that the 
equation 

Au + K(x)ui-z =0 

has infinetely many bounded positive solutions in M d , when the decay of 
|i^(a;)| is faster than C|x| _ near oo for some positive constant C > 0. 
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Under the same assumptions, this result was improved by Naito in [9] and 
Li and Ni in [7]. Kenig and Ni in [6] considered the problem 

Lu - k(x)u + K(x)u q = (3) 

d 

in M d , where q > l,Lu = J2 (di,j(x)u Xi ) x - and (a,i,j(x)) is symmetric, 

positive definite matrix with smooth coefficients of which the eigenvalues 
are of the order of magnitude \x\ at oo with — oo < a < 1. they 

showed that if there exists e > such that |-K"(a;)| < C\ \x\ ~ >-~ £ anc [ 
< k(x) < C 2 |x| ~~ >~~ s for \x\ large and for some positive constants 
Ciand C 2 , then (3) possesses infinitely many bounded positive solutions with 
the property that each of these solution is bounded below by a positive con- 
stant. 

Later on, Lin in [8] consider a matrix (oij(x)) such that ay = a J: j for 
i,j = 1, ..., d, are measurable and satisfy the uniform ellipticity condition and 
a function w : M + — > M + locally bounded and such that 



r w(r)dr = A < oo. 



They showed that there exists a constant 6 = 0(d,A) such that if |-K"(x)| < 
n+llh2 an d 1^(^)1 < (l+iSri) 2 ^ or some positive constant c > 0, then for 
every c G (|, |), (3) possesses a positive solution u c satisfying u c (x) — > c as 
|x| — >• oo. 

Chern in [3], consider the problem 

Am + K^vP + K 2 (x)u q = in R d (4) 

when d > 3, q > p > 1, Ki and i^2 are two given holder continuous functions 
on M d such that K\ > 0, i^2 < and there exist a Holder continuous function 
K = K(\x\) and a positive constant T such that 

i. X(|x|) < K 2 (x), 

oo 

ii. / sK(s)ds > — oo, 

o 

iii. KAx) < -T q -P.K 2 (x). 
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He showed particularly that there exists T > such that for all c > 
T , the problem (4) has a bounded positive solutions u c with the property 
u c (x) ->cas \x\ — ► oo. Also he gave some conditions to get the existence of 
maximal solutions of (4). 

2 Preliminary 

Let B(M d ) (C(M d ) resp.) denote the set of all measurable (continuous resp.) 
functions on M d . Given any set A of functions, let A + (Ab resp.) be the set 
of positive (bounded resp.) functions in A. 

Let L be a partial differential operaror in the following form: 

d Q2 d Q 

L= £ a i3 {x)— — - + Y^h{x)— + c(x) 

such that 

i. c < 0, 

ii. a,ij = dji for i,j = l...n, are positive measurable and satisfy the uniform 

d 
ellipticity condition: there exists A > 1 such that E a ij( x )£,i£,j > 

A-^^foraJlx^e^. 

iii. There exists a E ]0, 1[ such that 

d d 

J2 Wij(x) - Oij(y)\ + J2 \ b i( x ) ~ Hy)\ + |c(x) - c(y)\ < A \x - y\ a , 
iv. E I o^ 0*0 1 + E 1^(^)1 + \c(x)\ < A. 

i,j=l i=l 

We will denote by G L (G resp.) the Green function of L (A resp.) on M d . 
It is known that there exists a constant C = C(A, a, d) such that 

G.L ^ /"f II II 2 — a 

< C \\x — y\\ 
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Let / be a positive measurable function on all M d such that 

G L (x,y)f(y)(dy) 



x 



is a continuous bounded potential. 

We denote by K* the mapping defined on Bb(M d ) as follows: Vg G Bf,(lR d ) 



Kf{g){x) = K{fg){x) 



M d 



G L g{y)f{y){dy). 



Note that for every g G B^(M d ), K* (g) is a potential on M d . 
In the sequel, for an open subset V of M d , we will denote by S(V) the 
set of superharmonic functions on V. 

Proposition 2.1 The bounded operator I + K* on the banach space Bb(R d ) 
is invertible and 

< (I + K f Y l s < s for every s G S + (R d ) 

Proof. Similar to the proof of Proposition 2.5 in [2]. ■ 

Definition 2.1 A borel function f defined on M d is called Green-tight if and 
only if f satisfies the two conditions: 



lim 

m(A)-*0 

AcR d 



, f(y)(dy) 

SU P / . n ^2 

\_xen d A \\ x ~ y\\ 







(5) 



and 



lim 



, f(y)(dy) 

SU P / n ^=2 

, xeR dJy>M \\x -y\\ 



0. 



(6) 



In the next, we recall the Kato class defined in [1]. 



Definition 2.2 A borel function f is said to be in the Kato cass K&, if and 
only if 

f(y)(dy) 



lim 



SUP /,, „ II lld-2 

rxem dJ\\x-y\\<e \\ X -y\ 







and f G K l £ c , if and only if for every ball B in M d , 1b f G Kj. 
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In [11], Zhao introduced the class K™ as follows 






/ G K?: lim 

M^oo 



sup 



f(y)(dy) 

id- 2 



y>M |x — y 







Proposition 2.2 ^4 borel function f defined on M d is Green-tight if and only 

iffeKf. 

Proof, see [12]. ■ 

Proposition 2.3 Let f be a borel function in M d . Suppose that f is in the 
Kato cass K l f c and there exits a number L > and a positive function ip 
defined on [L, oof such that 



and 



V\x\>L,\f(y)\<tp(\y\) 



np{r)dr < oo. 



(7) 



Then f is a Green- tight function in M d . 

Proof. Since (5) is obvious, we shall prove that / satisfies the limit 
equality in (6). 

For each x G M d , we take a coordinate system (r, $i, ...,9 n ) such that 
x = (\x\ , 0, ..., 0). Then, we have by ( 7) that VM > L 

\f(v)\ ... / f <p(\v\) 



y>M \\x — y 



id- 2 



dy < I 

Jv 



y>M \\x — y 

< CTd-2 / V?( r ) 

JM 



°d-2 I r(f(r) 

IM 



id- 2 



dy 



{sin9 1 Y- I d9 1 



o (r 2 + \x\ — 2r|x|cos^i) 2 

(sin#i) fl! - 2 ^i 
n (l + (M)2_2(M )cos ^ l) ^ 



r d_1 dr 



dr, 



where o~d-2 is the area of Sd_2(l). On the other hand, we have the estimate, 
Vx>0 



( sin l 



\d-2 



d-2 

2 



dd < T- l ix. 



It follows that 



SUP / ^32 

xeRd Jy>M \\x-y\\ 



(1 + |x| — 2r \x\ cos9)~ 

roo 

dy < o- d _ 2 2^ 2 7r / r<p(r)dr — > as M — ► 00. 
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Remark 2.1 If \f(x)\ < 5j +E for \\x\\ large and for some e,c > 0, then f 

\\ x \\ 
is Green-tight. 

Let C (R d ) ={ue C(R d ) : lim|| a .||_ 0O u(x) = o} . 
Proposition 2.4 Let $ be a Green-tight function in M d and 
Fs, = {ipe B(R d ) : \<p(x)\ < |$(x)| , Vx e iR d } . 

Then, the family < f M d _ Jl-^ dy^ G Fq>> is uniformly bounded and equicon- 
tinuous in Co(M d ), and consequently it is relatively compact in Co(M d ). 
Proof. For each <p e F$, we have 

JR d \\x-y\\ JR d \\x-y\\ 

Hence, the family F$ is uniformly bounded and lim^i^oo f Md _ Vj-z dy = 
uniformly for all function tp E F®. 
Finally, let x, x' e iR d , then 

<p(y), f <p(y) 

x — y\\ J JR d \\x' — y 



Jn? \\ x _ v \\ d - 2 y J R d || T /_ 7 ,|| rf - 2 - J M 



I iirf — 2 II , it rf— 2 

\x-y\\ \\x'-y\\ 



$(y)\dy^0 



as x — > x . 

Thus the family F$ is equicontinuous. Since it is uniformly bounded, we 
conclude by Ascoli-Arzela theorem that the family F& is relatively compact 
in C (M d ). m 

Since G and G L are comparable, then we get the following result. 
Corollary 2.1 Let $ be a Green-tight function in M d and 

F^ = {ipe B(R d ) : \<p(x)\ < |$(x)| , Vx e iR d } . 

Then, the family lf M d G L \f{y)\ dy,(p £ F§> is uniformly bounded and equicon- 
tinuous in Co(M d ), and consequently it is relatively compact in Co(iR d ). 

Let /i (/2resp.) denote the map: y — ► fi(y)y (y — *■ f2(y)y resp.) and for 
all y > 0, we denote by fc y , the function x — > fcy(x) = fci(x) + yk 2 (x). 

In all the next, we assume that the functions k\ and fc 2 are Green-tight, 
and that the functions f\ and / 2 are nonnegative and continuous, 
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3 Existence of positive solutions u c for (1) 

In this section, we shall prove the existence of positive solutions u c of (1) 
which satisfies (2). 

We suppose that the following property is satisfied: 

(A) Vc > 0, there exists a c > such that for every x G M d , the function 

y -> -ki(x)fi(y)y - k 2 (x)f 2 (y)y + a c (\k 2 (x)\ + \h(x)\)y 

is increasing on [0, c]. 

Particularly; if / is locally Lipschitz, then (A) is satisfied. 
In the next, we shall prove the following result: 

Theorem 3.1 We suppose that the set 

p = { c > o : fei(x)/i(c) + k 2 (x)f 2 (c) > 0, Vx e R d ] 

is nonempty. 

Then, for all c G P, there exists a positive solution u c of (1) which satisfies 
(2). 

Corollary 3.1 We Suppose that: 

1. k 2 >0 on R d , 

2. There exists T > such that kT (x) > 0,Vx G M d , 

3. lim^oo^g] = +oo. 

Then, there exits m > 0, such that for each c > m , equation (1) has a 
positive solution u c which satisfies (2). 

Proof. There exits m > 0, such that for each y > m , we have j¥\ > T . 
Hence, for each c > m , we have 

fci(x)/i(c) + A;2(x)/2(c) = /i(c)(A; 1 ( a :) + A; 2 ( a :) /2!r) > 



/i(c) 
> f 1 (c)k To (x)>0. 



The conclusion follows by using Theorem 3.1. 
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Remark 3.1 Note that the condition h\ and h 2 are Green-tight is more gen- 
eral than the decay condition \k 2 \ < cr~ l near oo for some I > 2 which 
appeared first in [9], later in [6] and [10] and it is more general than the 
condition 

oo 

/ rK(r)dr < oo 

o 

which appeared in [4] and [3]. 

Moreover, Lin proved in [8] that if h 2 > Q-e , for some e > 0, then (1) 

has no positive solution in R d . He also showed that if L = A, hi = and 
hi = h\{r) = jt^2, equation (1) has no bounded solution in M d . 

In what follows, we shall prove Theorem 3.1. 

Definition 3.1 We say that a function u is a subsolution of (1) if 

u + K klh{u \u) + K k2hi - U \u) e S{R d ). 

We say that a function u is a sursolution of (1) if 

u + K klhi - U \u) + K k2f2{u \u) e -S(R d ). 

Proposition 3.1 Suppose that there exists two bounded functions u < vq 
such that uq is a subsolution of (1) satisfyuing (2) and vq a sursolution of 
(1) satisfyuing (2). Then there exits a solution w of (1) which satisfies (2) 
and such that uq <w <vq. 

Proof. There exit s € S(R d ) and s' e -S(R d ) such that 

v + K klMvo) v + K k2f2< - Va) v = c + s 

and 

u + K klMuo) u + K k2Muo) u = c + s'. 

Since lim|| a; ||_^ 00 i>o(a;) = lim|| i; ||_ i . oo w (a;) = c, it follows that lim||, c ||^ 00 s(:r) = 
lim|j. E ||^ 00 s / (:r) = 0, we get s > and —s' > on R d . Let c = max(||-u || , \\ v o\\) 
and a c satisfying (A 3 ), then for each u G C^(R d ), we set 

F(u) = -hi(x)fi(u)u - h 2 (x)f 2 (u)u + a c (\h 2 (x)\ + \ki(x)\)u 
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and 



'c 1 



Now, we define an operator T on C^(M ) as follows: 

Vm e C+{IR d ),T{u) = (/ + KY\K F ^ + c). 

First, we claim that the operator T is increasing. In fact, if < u < v, 
then by Proposition 2.1, 2» - T{u) = (I + ^-i^KM > . On 
the other hand, we have T(v ) < v . In fact, since (/ + K)(v — T(v )) = 
v + K klfl ^v + K k * f ^v -c = s E S + (R d ), hence, by Proposition 2.1, 
v -T(v ) = (I + K)- 1 s>0. 

Similarly, we show that T(w ) > u . Thus, the sequence defined induc- 
tively by u n = T(w„_i) and v n = T(v n -i) are monotonous and u n < v n . Let 
w = linijj^ooiv Since by Proposition 2.4 the operator T is compact and con- 
tinuous, we get T(w) = w and consequently w + K kl ^^w + K k2 ^ w ^w = c. 



In what follows, we set E = lu E Co(]R d ) : < u < c\ and we define the 
operator F as follows: 

Vw e E, F{u) = (/ + i^/iW+fc 2 + /2(«))-i c . 
Lemma 3.1 F(E) is relatively compact in Co(M d ). 
Proof. Let {u n ) n be a sequence in E. Then 

F{u n ) + K(ktfi(u n ) + k£f 2 (u n )(F(u n )) = c. 
By Proposition 2.1, we have 

< F(u n ) < c. 
Thus, by Proposition 2.4, the family 

{K(ktfi(u n ) + k+f 2 (u n ))(F(u n ))) :neN} 
is relatively compact in C (M d ) m 
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Lemma 3.2 The operator F is continuous. 

Proof. Let (u n ) n be a sequence in E such that (u n ) n converges uniformly 
to a function u G Co(M d ). Let us suppose that (F(u n )) n converges to a 
function v. Then v + K k ^ M a ) +k 2 f'^ u ){v) = c and therefore v — F(u). ■ 

Lemma 3.3 Let c > 0. Then there exists v c G E such that v c is a subsolution 
of (1) satisfying (2). 

Proof. We have F(E) C E. Since F is completely continuous, then by 
Shauder's fixed point Theorem, we conclude that the operator F has a fixed 
point v c satisfying 

v c + Kkffi(v c )v c + Kk^f 2 (v c )v c = c. 

Hence 

v c + Kkxfityjvc + Kk 2 f 2 (v c )v c = c - Kk^f x (v c )v c - Kk^ f 2 (v c )v c . 

Thus v c is a subsolution of (1). Since k\ and k 2 are Green-tight, then 

lim iir(|A; 1 / 2 ( U > c |)(x) = || ]im K(\k 2 f 2 (v c )v c \)(x) = 

]|a; || — s-oo ||x||— »oo 

and therefore lim|| ;r .||^ 00 f c (x) = c. ■ 

Now, we are ready to prove Theorem 3.1. 
Proof of Theorem 3.1 

Since c G P, then v o = c is a sursolution of (1) and satisfies (2). Let 
v c as in Lemma 3.3, then v c is a subsolution of (1) which satisfies (2) and 
v c < fo- Thus, by Proposition 3.1, there exits a solution u c of (1) such that 
v c <u c < v , obviously lim|| x ||_ >00 w c (a;) = c. 

4 Existence of maximal solution ut 

For this section, we assume that the following conditions hold: 

(-^i) k 2 > on M d and locally bounded, 
(-^2) /1 and / 2 are derivable on ]0, 00 [ , 
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(F 3 )\xm y -> 00 &T(y) = +00. 

71 
There exists a > such that : 

{F^jfi is increasing on ]o, 00 [ and linij / _ >00 /i(y) = 00 

(F 5 ) There exist S > and q > 1 such that /2(y) > Sy q-1 , for all y > a. 

Example 4.1 PFe can choose f\ and f 2 as follows: 
/1 = P, where P is a polynome of degree p > 1. 
/i(y) =y p log(y), p> 1. 

/2 = Q, where Q is a polynome of degree q > p. 
f 2 (y) = H(y)exp(\y), A > 0, where H is a polynome. 

Remark 4.1 For every x,y > 0, there exists c XjV G ]x, y[ such that 



Mx) - My) = (h) (cx,„) 
/iW-/i(y) (/i)'(c^)' 

T/ins, fry the assumptions given above, we get that lim&(y) = +00. 

/1 



In this section, we shall prove the following results: 

Theorem 4.1 We suppose that there exist a sequence of bounded smooth 
domains {Qi} i( - N , a sequence of positive real (S^^jn and T > a such that 

1. R d = UZi^i with Wi C Q i+ i, i = 1,2, ..., 

2. k To > on R d , 

3. For each i, k 2 > Si on a neighberhood of OVti , 

4- For each i, there exists 1 > pi > such that k± + (1 — Pi)k 2 To > on 
a neighberhood of dQi, 

5. There exists a positive solution v of (1) such that v > mo, where mo > 
T satisfies Wyrl > T and ^ > T for all y >m . 
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Then, the function 

U mo = sup {u(x) : u is a positive solution of (1) with u > itlq} 
is well defined and is a solution of (1). 
Corollary 4.1 We suppose that 

1. k\ and k 2 are continuous, 

2. There exist T > a and a sequence (r n ) n — ► oo such that k.T ( x ) > 
and k 2 (x) > for all \\x\\ = r n , 

3. k To > on R d , 

4- There exists a positive solution v of (1) such that v > m , where m > 
T satisfies |&^ > T and ^||| > T for all y>m . 

Then, the function 

U mo = sup {u(x) : u is a positive solution of (1) with u > itlq} 
is well defined and is a solution of (1). 

Proof. M d = U°^ -8(0, Tj). Since k\ and k 2 are continuous, then for each 
i, there exist 5i > and 1 > pi > such that kj^o > Si and Pik 2 T < <5j on a 
neighberhood of -8(0, Tj). Thus k\ + (1 — p^k^T^ > on a neighberhood of 
-6(0, Tj). The conclusion follows by Theorem 4.1. 

■ 

In what follows we shall prove some preliminary results which will be 
needed for the proof of Theorem 4.1. 

Proposition 4.1 Let^ > 0, q > 1. Letx G lR d , U = Ix G lR d : \\x — x \\ < R\ 

for R > 0. Then there exists a function u such that 

Lu - 7« 9 < inU (8) 

and 

lim-u(x) = oo, for all z G dU. 
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Moreover 

u(x) = \(R 2 — \\x — xo\\ )« zrT 

3 

where A = C(l V R)"- 1 for a positive constant C which depends only on 0,7, 
the dimention d and the upper bound for a,ij and hi . 

— 2 

Proof. Let u = X(R 2 — r 2 )^ 1 for some A > 0, where r = \\x — rr ||. By 
a direct computation we get: 

L w - 7 ^ = A(i? 2 -r 2 )^( Cl ^o i ^^ j H-c 2 (i? 2 -r 2 )(^a ii H-^& i ^)H- C (i? 2 -r 2 )^- 7 A«- 1 ) 

(9) 
Where Zi = Xi — Xo,i, Ci = 8(q + l)(q — 1) 1 , c 2 = 4(g — 1) 2 . Let A and 
B be the upper bounds J2 \o>u\ and I] |0j| • Then (9) implies (8) if 

Cl Ar 2 + c 2 A(R 2 - r 2 ) + Bc 2 R 3 - 7A 9 " 1 < (10) 

for all < r < R. The condition (10) holds if 

xq _, > (d + c 2 )AR 2 + Bc 2 R 3 

7 

which is true for A = C(l V R)^ , where C = ( (^^^^ ft , ■ 

Notation 4.1 Lei f2 &e an open subset of M d and f be a measurable function 
on M d , we denote by 

KL= I G&;y)f(y)dy 

Jn 

where G^ denotes the Green function in Q. 

Proposition 4.2 Let 5,7 > and u, v such that Lu — / ~fu q < and Lv — 

/ -fv q > in Q. If lim inf x ^ z u(x) > limsup ;r ^ z n(a;) for every z G dQ, then 
u > v on Q. 

Proof. Let s = u — v. Then liminf x _ +2 s(x) > for every z G dVt and 
Ls — 7( M Z v ) s < 0- Using the minimum Principle, we obtain u — v > on 
Q. m 

For the next, we consider the problem 
, , J Lu — kifi (u)u — k 2 f 2 (u)u = in f2 
m ' ) \im x _> y u(x) = m, for all y G dil 
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Definition 4.1 We say that u is a subsolution for (E m ) if 

j Lu — kifi(u)u — k 2 f2(u)u > m Q 
1 u < m on dil. 

We say that u is a sursolution for (E m ) if 

j Lu — kifi(u)u — k 2 f2(u)u < in $1 
1 u > m on dVL. 

Proposition 4.3 We suppose that there exists a subsolution u and a surso- 
lution v for (E m ) such that u < v , then there exists a solution w of (E m ) 
such that u < w < v. 

Proof. There exists s, s' E S b (fl) such that v + K k ^ h(v) ~ k2h(v \v) = m + s 
and u + Kq (u) = m — s' . Since o>mon dVt and u<mon <9f2, 

we conclude that s > and s' < 0. 

Let c = Max(||w|| , \\v\\) and a c satsfiying (A). For each u G C^(M d ), we 
set 

F(u) = -ki(x)fi(u)u - k 2 (x)f 2 (u)u + a c (|^2(^)| + \h(x)\)u 

and 

We define an operator T on C^(R d ) as follows: T(u) = (I+K)- 1 (K F ^+m). 
Hence, as in the proof of the Proposition 3.1, the operator T has a fixed point 
u < w < v which is solution of (E m ). m 

Lemma 4.1 Let Q be a smooth bounded domain. Suppose that there exists 
Tq > a such that kx (x) > on Vt. Let tuq > To such that jy4 > To and 

§£>T for ally >m . 

If the equation (1) possesses a positive solution v > mo, then, for each 
m > mi = max{t)(i) : x G dQ}, the problem (E m ) has a solution w m > v 
on Q. Moreover the sequence (w m ) m > mi is increasing. 

Proof. Let m > m&x{v(x) : x G dQ}. It is obvious that v is a subso- 
lution of (E m ). On the other hand, since k m > for m > mi, then m is a 
sursolution of (E m ). Thus, we conclude by Proposition 4.3 that there exists 
a solution w m of (E m ) such that v < w m < m. 
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Now, by setting z = w m+ \ — w m , we get 

{Lz — hz= in f2 
z = 1 on d£l 

where h = (h + fc 2 ( f ^\~l ^ m \ )) 7l{w ^~l {Wm) . Since w m+1 ,w m > m , 

fl{Wm + l)-fl(W m ) W m + 1 ~W m ^ 

then h > on Q and by the minimum principle, we get z > on Q. m 

Proposition 4.4 Let To and ttlq > Tq as in Lemma 4-1- Suppose that there 
exists a sequence of bounded smooth domains {Qi} i( - N and a sequence of 
positive real (<5j)ieAr such that: 

1. M d = U^fi; with Hi C Q i+1 , i = l,2, ..., 

2. k To > on R d , 

3. For each i, k 2 > Si > on a neighberhood of dfii , 

4- For each i, there exists 1 > p^ > such that k± + (1 — pi)k 2 T > on 
a neighberhood of dQi, 

5. Equation (1) possesses a positive solution v satisfying v(x) > m for 
all x E R d . 

Then, there exists a positive solution Ui satisfying 

{ Lui - kifi{ui)ui - k 2 f2{ui)ui = mtti 
[ \im. x ^ y Ui{x) = oo , for all y G dili. 

Furthermore Ui > Ui + \ > itlq on Qi for each i. 

Proof. Since dfli is smooth, we can use (x, tv(x)),x G dVti, \t\ < e, for 
some 1 > e > 0, as local coordinates for a neigberhood of dQi, where u(x) is 
the outward normal of OVti at x G <9f2j. From the assumptions 3 and 4, we 
can choose e > so small that 

f k 1 (x,tu(x)) + (1 - pi)k2(x,tu(x))T > 
[ k 2 (x,tv(x)) >5i>0 

for all — e < t < and for all x G dVti. Now, fix x G <9f2j, — e < t < 
t' < and let y = (x,t'u(x)). Put U y = lz G lR d : \\z - y\\ < R,\ where 
i? = inf (-£',£' -t). 
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For each i, let (w m ) m be the sequence defined in Lemma 4.1. Then, on 



U y we have 



Lw m = fafi{w m )w m + k 2 h{w m )w r , 



J m 

> (fa + (1 - pi)k 2 T )f l (w m )w m + pik 2 f 2 (w m )w. 

> piSSiW^. 



[fa + (1 - Pi)k 2 x 2 , m [ )fi{w m )w m + pik 2 f 2 (w m )w n 



By Proposition 4.1, there exists a suitible A such that the function 

u(z) = \(R 2 - \\z-y\\ 2 )^ 

satisfies 

Lu — p55iU q < on U y . 

Consequently, by Proposition 4.2 we get 

w m {z) < u(z), for all z G U y . 
Particularly, for z = y, we get 

But, from hypothesis 2, we have that w m is a subharmonic function on M d , 
which yields 

where Vt\ = Qi — {(x, t'u(x)),x G dQi, t < t' < 0}. Hence, for every — e <t< 
0, (u; m ) m > mi is increasing and uniformly bounded in Vt\. Thus (w m ) m > mi 
converges uniformly in Vt\ to a function Ui which satisfies 

Lui - kifi(ui)Ui - k 2 f 2 (ui)ui = 0. 

Now, let z = Ui — Ui+i. Then z satisfies 

Lz — hz = on Qi, 

where /i = (fa + fc 2 ( ^("-)-^("*+i) )) /i("0-7iK+i) > . We can choose e > 

small enough to get z > on <9fi* for — e < t < 0. By using the minimum 
principle, we get 

z > on dVt\ for all e < t < 
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and hence Ui > u i+ ion Qj. ■ 

Next, we give The proof of Theorm 4.1. 

Proof of Theorm 4.1. 

Consider the sequence of functions (-Uj)j defined in Proposition 4.4. Then, 
for each i, Ui > v > mo in f2j. Thus, (-Uj)j converges to a function u such that 
u > rriQ and u is a solution of (1). On the other hand, if u is a solution of 
(1) such that u > m , then, u < u^ on f2j. Thus, w < u on iR d . This proves 
that u = U mo . 



5 Asymptotic behaviour of the maximal so- 
lution 

In this section, we keep the assumptions given in Section 4. We will study 
the asymptotic behaviour of U mo . We shall prove the following results. 

Theorem 5.1 Suppose that there exist T > a and 1 > p > such that 

1. h + (1 - p)k 2 T > on R d , 

2. There exists a positive solution v of (1) such that v > m , where m > 
To satisfies j¥\ > To for all y > mo- 

3. /c 2 (:r) = \x\ v near oo for some v > 2. 
Then, the maximal solution U mo is such that 

1- U mo (x) = 0(\x\#) when E N ^ 0. 

i=i 

£. C/ mo (x) = 0(|x|^) w/ien E \h\ = 0. 

i=i 

Theorem 5.2 VKe suppose that L = A and that the assumptions 1, 2 and 3 
m Theorem 5.1 are satisfied. Moreover, we assume that there exists a > 
such that k\ < ak^To- If fiiy) = y q ~ x , then, the maximal solution U mo is 
such that 
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2 ' 



U mo (x)^(\x\^). 

Such a result was proved for radial solutions in [4] when k\ = 0, f 2 (y) = 
y^ 1 and in [3] when k\ < and f 2 (y) = V q ~ x ■ 

Proof of Theorm 5.1. 

Let C > 0, R > 1 such that k 2 (r) > Cr~ v for all r e [R , +oo[. We fix 
a; G -R d such that ||x|| > 2R and we set U = B(x, R) where R 
Hence, if v is a solution of (1) such that v > m , we get 

Lv = hf 1 (v)v + k 2 f 2 (v)v 

= hfi(v)v + (1 - p)k 2 f 2 (v)v + pk 2 f 2 (v)v 

= j 1 ( V )(k 1 + (i-p)k 2 iM)+pkj 2 (v) 

AW 

> P 5k 2 v q . 
Thus, for each y E U, we have i?o < ||y|| < 3/2 and 

Lv(y) > p5C\\y\\' u v q (y) 
> ^(y). 

By Proposition 4.1, the function 

M (t/) = A(i? 2 -||x-i/|| 2 )^ 

satisfies 

f Lu- 0§;u q < 0, on U 

\ lim y _ z u(y) = oo,\/z e dU 

A - 3 pic R ■ 

Thus, if £ |6,| ^ 0, we take A = Ai = (3 u (ci+C2} £ +Bc2 )^R^ 
and if £ |^| = 0, we take A = A 2 = (3^ (ci +g )A )^i2^ . 



for 
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In bouth cases, we have 

v(y) < u(y),Vye U 

and consequently, we get 

v(x) < \(R 2 )^. 

Hence, for A = Ai, we obtain that 



1/-1 



v ( x ) ^ ( 3 2 »-ip6C } " X " 



which yields 

For A = A2, we obtain that 



U mc =0(\\x\\ft). 



»W<(3'|^)-I# 



which yields 



U mo =0(\\x\\^). 



Proof of Theorm 5.2. 



We supoose that f2(y) = y q x and L = A. Thus, by Theorem 2 in [4], 
the equation 

Aw - (1 + a)k 2 u q = (13) 

has a maximal solution U such that U(x) = \x\1- 1 near 00. Now, let Mj be 
the solution of (12), then by setting s = Ui — U, we get 

As - (1 + a )A; 2 ^— p s = (k, - akjj^-)!^) < 0. (14) 

u i~U fi(ui) 

Since s > on a neigberhood of <9f2j, we get by the maximum principle that 
Ui > U on Qj. which yields that C/ mo > [/ on fij for all i. Thus by Theorem 

5.1 we get U mo (x) = \x\ 1- 1 near 00. 
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Abstract 

The functions which are continuous and change their sign finitely many times 
on the segment [a, b] are considered. For such functions all cases are investigated, 
when the analogs of uniform and pointwisc Jackson type estimates are valid for the 
approximation of function by copositive with it polynomial. 
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1 Introduction 

Let I := [-1, 1], and for s £ N let Y := {yjf =1 , - 1 = y s +i < • • • < Vi < Vo = 1; Y 8 - 
the set of all such collections. Finally let A^°'{Y) be the set of continuous functions / on 
/ such that / is non-negative on [yj,j/j_i], when i is odd and is non-positive, when i is 
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even, and set 

s 
U(x):=U(x,Y):=H(x- yi ). 

i=l 

Note that / G A'°)(Y) if and only if /(x)II(x) ^ 0, x G I. A function g is said to be 
copositive with / G A<W(T) if g{x)f{x) ^ 0, iGl 
The following notations are used {k G N; r, n G No): 

C:=C(/), 

ll/H :=max||/(x)|:xG/}, 
C (r) . = |j . y(r) g ^ 

V n — the space of algebraic polynomials of degree ^ n, 

£„(/):= inf{||/ -P n \\: P n eV n }, 

4°)(/, F) := inf{||/ - P n \\ : P n eV n n A<-°\Y)}, 



1 \/l — x 2 
Pn(x) := -J H ;r — , x£l (po(x) := 1, briefly p n (x) =: p), 



n^ n 



^fc(/; i) — the ordinary fc-th modulus of smoothness of function / G C. 

The main result of this paper is the following theorems. 

Theorem 1. Let k = 1, or r = s — 1, k = 2, or r ^ s, and Y G Y s . If f £ 
C' r ) n A(°'(Y), then for every n ^ k + r — 1 i/ie polynomial P n G Pn ("I A' '(V) exists such 
that 

|/(x)-P„(x)| < C(k,r,s)p r n (x)u k (f^; Pn (x)). 

Theorem 2. Let r ^ 1, or r = 0, jfe < 3, and 7eY s . // / G C^ n A^(Y), then 
for every n ^ N(Y) ^ A; + r — 1 the polynomial P n G P n n A^'iY) exists such that 

|/(x)-P„(x)| < C(fc,r,5)p;(xK(/W;p„(x)), 

and /or every n ^ k + r — 1 the polynomial P n G V n n A' **(Y) exists such that 

|/(x) - P n (x)| < C(fe, r, yK(xK(/M; p n (x)). 
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If we compare Theorems 1 and 2 and Counterexamples from Section 5 (which are valid 
in the uniform case) we are possible to make some conclusions in graphical form (they 
are the same in uniform and pointwise case, see Tables I-IV below). In tables I— IV entry 
+ means that at the corresponding k, r, s the estimate for / G A^°>(Y) and for some 
polynomial P n <= A^ (Y) 



\f(x)-P n (x)\ ^ Cp r n (x)u; k (f^;p n (x)), n^N 



XI) 



is valid for C = C(k,r,s) and N = k + r — 1. Entry © means that (1.1) is valid for 
C = C(k, r, s) and N = N(Y) ^ k + r — 1 and vice versa for C = C(k, r, Y) and 
N = k + r — 1, and it cannot be obtained with C = C(k, r, s) and N = k + r — 1. And 
finally entry — means that (1.1) isn't valid even with the constants which depend on /. 

Tables of Truth 

I. The case s = 1 



3 + + + + + • 

2 + + + + + • 

1 + + + + + • 

+ + © 

12 3 4 5 k 



II. The case s 



3 + + + + + • 

2 + + + + + • 

1 + + © © © • 

+ © © 

12 3 4 5 k 



III. The case s = 3 



r : : : : : 

3 + + + + + • 

2 + + © © © • 

1 + © © © © • 

+ © © 

12 3 4 5 k 



IV. The case s > 3 
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r 






s 


+ 


+ 


s-1 


+ 


+ 


8-2 


+ 


© 


1 


+ 


© 





+ 


© 




1 


2 



+ + + 



k 

Let's give a short history of the problem. The first estimate in copositive approxima- 
tion of Jackson type was obtained by D.Leviatan[5]. The estimate (1.1) was established 
with at least one constant depending on Y for r = 0, k ^ 3 by Y.K.Hu, K.A.Kopotun and 
X.M.Yu[3] and for r ^ 1 by G.A.Dzyubenkofl] and Y.K.Hu, K.A.Kopotun and X.M.Yu[3] 
(as the consequence of the result in so-called intertwining approximation). A counterex- 
ample for r = 0, k ^ 4 and s = 1 was obtained by S.P.Zhou [8]. For more detailed survey 
one can address to the work by Y.K.Hu and X.M.Yu[4]. 

We will use the Chebyshev partition of [—1,1], namely, let x~i := 1, x^+i := —1 
and for each j = 0,...,n, set Xj := Xj ;n := cos(J7r/n), Ij := [xj,Xj-i], and hj := \Ij\ : = 
Xj — Xj-\. The following inequalities take place: 

p n {x) < hj < 5p n (x), x £ Ij, j = l,...,n (1.2) 

hj±i < 3hj, j = l,...,n, (1.3) 

P 2 n(y) <±p n (x)(\x-y\ + p n (x)), x,y€l, (1.4) 

which implies 

2(\x-y\ + p n (x)) >\x-y\ + p n (y), x,y € I. (1.5) 

Let's set for each j = 1, n 

h 3 .. r„\ .. Pn( x ) 



JK ' \x-Xj\+hj JK ' dist(x,Ij) + p n (x) 



where dist(x,E) = inf{|x — y\ : y € E). 
We make use of some other inequalities: 



For a given Y, let 



and set 



J2hji](x)<20p n (x), (1.6) 

3=1 

7?(x) < 16rj-(x), rj(x) < 400 7i (x). (1.7) 

Oi := (ajj+i,a;_ 7 -_2), if 2/i € [xj,Xj-{), 



0:={jO t . 
i=l 

Next let O g := [a g , 6g], g ^ s, be the connected components of the closure O of O, indexed 
so that 6g+i < cig. We will write j 6 iJ, if Ij n O = 0, j = 1, n. 
The next inequality is easily verified: 



n(x) 



^ l^^ + lj ■ -re /. /ye/\0. (1.8) 
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2 Auxiliary lemmas 

Let if G $ fc , i.e., <^(0+) = and ip(t) is nondecreasing while t~ k (p(t) is nonincreasing on 
(0, oo). If function / belongs to C^ r ' and u>k(f ] ") ^ <Pi we w iU denote this by / G W r H^. 
It is assumed in the paper that / G W r H^ n A^°'(Y) for some k, r and Y G Y s . We will 
stand c, Cj, z G N for different constants, which depend only on k, r, s (but not Y). 

First we recall one known statement, which we need. It was proved by J.Gilewicz and 
I.A.Shevchuk[2]. 

Statement 1. Let s G N, Y G Y S; k G N, (r + 1) G N, [a, b] C /, h := b - a, g G 
W r H^n A(°'(Y). // i) fc = 1 or ii) s = r + 1, k = 2 or hi) s ^ r, i/ten suc/i a polynomial 
Pk+r-i(x) '■= Pk+r-i(x, g, [a, b], k, r) of degree ^ k + r — 1 exists i/iai /or ewerj/ x G [a, 6] 

|g(x) -Pjk +r ._i(x)| < c*(k,r)h r <p(h), 

and 

P k+r - 1 (x)U(x) ^ 0, xG[a,6], 

where c*{k,r) = const depends only on k and r. 

Let's denote by 27& the collection of piecewise polynomials (not necessarily continuous) 
of degree ^ k with the knots at the Xj's, and denote by T, k o the subset of S G Xfc which 
are polynomials on each O q . When we use the notation S{xj) we mean either S{xj+) or 
S(xj-), when x tends to Xj from the right and from the left respectively. This makes no 
difference in our arguments. Note that all derivatives of the piecewise polynomial (spline) 
exist except perhaps at the Chebyshev nodes. When we will use them for Chebyshev 
nodes, we will make corresponding reservations. 

Lemma 1. If f G C[a, b] and f{x) ^ for x G [a, b], then a polynomial Pk-i of degree 
^ k — 1 exists such that 

\\f-Pk-i\\ [aM ^cu k (f;h;[a,b]), (2.1) 

and 

P fc _i(x)^0, x€[a,b], (2.2) 

where h = b — a. 

Proof. Let Pt-y be the polynomial of the best uniform approximation of / in [a,b], 
and 

Ek ■= ll/-- P fe-lll[a,6]- 

Let's set 

P k _ 1 (x):=P^_ 1 + E k . 

Inequality (2.2) is evident and 

\\f ~ P k-l\\[ a ,b] ^ 2E k- 

Then (2.1) follows from Whitney's inequality. □ 

An obvious consequence of Statement 1, Lemma 1 and (1.3) is following 

Lemma 2. Let r ^ s, or s = r + 1, k = 2, or k = 1. Then there exists S G 
Sjk+r-1,0 n A(°)(Y) such that 

H/-5L. ^chMf^hhj), j = l,...,n. (2.3) 
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Lemma 2 can be formulated also for many other k, r, s, provided we take n > N(Y), 
where N(Y) is taken so that for each n > N(Y) every O q contains exactly one point 
y q G Y. So we have 

Lemma 3. Let r ^ 1, or r = 0, k = 1,2 and suppose that n > N(Y). Then a piecewise 
polynomial S G Tj^+r-i^o H A'™(Y) exists such that (2.3) holds. 

Copositive piecewise polynomial, which gives proper estimate, can be constructed also 
when r = 0, k = 3. For this case we have 

Lemma 4. Let r = 0, k = 3 and n > Ni(Y). There is an S G ^2,0 H A'°'(Y) such 
that 

||/-5|| J .<ou;3(/;M J 3 = h-,n. (2.4) 

Proof. We choose iVi(Y) so that for all n > N±(Y) holds: if for some i, j yi G Ij then 
Ij-i nY = Ij + i n Y = 0. When for certain i,j yi G Ij let Pj(x) := L(x; /; Xj+i, yi,Xj-2), 
where L is a quadratic Lagrange polynomial interpolating the function / at the points 
Xj+i,yi,Xj-2- For these i, j it satisfies 

||/-i 5 J -|| .<c W3 (|O i |) J (2-5) 

II " I I \_s % 

and 

Tj(x)n(x) > 0, x G Oj. (2.6) 

Indeed, (2.6) is evident, and (2.5) follows, for example, from [7], Theorem 4.2. Thus 
Lemma 4 follows from (2.5), (2.6), (1.3) and Lemma 1. □ 

Now let Iij be the smallest interval containing Ii and Ij and denote hij := \Iij\- 
Following [6], for S G Sfe-i put 



llPi-Pillj. f fcj 



Oi J = o ij -(g,y)== y(M ' (^-j , *,j = l,n, (2.7) 

where p« is the polynomial defined by Pi\i { '■= S^- Finally let 

a k := a k (S,ip) := max aij(S,(p), 

The next Statement 2 which we need was proved by D.Leviatan and I.A.Shevchuk [6] 
(Lemma 6). 

Statement 2. There is a constant c, depending only on k, such that for any f G H? 
and S G £fc_i, if 

11/-%. <<p(hj), j = l,...,n, (2.8) 

then 

a k ^ c. (2.9) 

We need one statement for the partition of unity, proved in already mentioned paper 
by J.Gilewicz and I.A.Shevchuk [2]. 

Statement 3. For each fixed integer I there exists a collection {Tj^ n } T - =1 of polyno- 
mials of degree ^ cln with the following properties: 

n 

Y,rj,n{x) = l, (2.10) 

i=i 

iff(*) ^^f l "ftlwJ m * G/ > A = 0,l,2,..., (2.11) 



ON THE DEGREE OF COPOSITIVE APPROXIMATION 227 



where C = C(s, I, A), and 

Tj,n(yi) = 0, Vi, j such that m £ O q and Ij (jL O q . (2-12) 

We have to prove one more lemma. 

Lemma 5. Let I ^ 3k — 1 and assume that S £ ^k-io an d S(yi) = 0, i = l,...,s. 
Then the polynomial 

n 

D n {x) := y~]pi(x)T ijn (x), (2.13) 

i=l 

satisfies 

D n ( yi ) = 0, 1 < i < a, (2.14) 

and for each A = 0,..., s, 

SW( x )- D W(x) ^C a k (S)^f-, x£l, (2.15) 

Pn\ x ) 

where Co = Cq(s, k, I). 

Remark 1. By S^ x \xj), j = l,...,n we mean either S^'(xj+) or S^'(xj-) and 

inequality (2.15) is considered for x — ► Xj+ or x — > Xj— respectively. It doesn't matter 
what derivative we choose. This fact can be seen from the proof of Lemma 8. 

Proof. Evidently (2.14) follows by (2.12), where we use the fact that S is a single 
polynomial in each connected component of O. Further we fix 1 ^ v ^ n, and x £ I v . 
Since p u — pi is a polynomial of degree not exceeding k — 1 , then 

\\Pv-Pi\\l v <c(-j^-J \\Pu-Pi\\i.- 



Hence by (1.7) and (2.7), 



\Pv~Pi\\l v <c{ -j^-j {-^)v(h u )ai, u 



3fc-2 



< ca i:V ip(h v )[ -jf- ) =: cfii,, 



(2.16) 



Now by (2.10), 



S(x) - D n {x) = ^2(p v (x) -pi{x))Ti{x) =: y~]ai(x), (2.17) 



and by virtue of (2.11) and (2.16), 



\oii{x)\ ^ c(l,k)cH jV hi(p(hu)' ''" 



Hence 



&„/ (p+lx-Xil)^ 1 

/ \ / \ i+3— 3fc 

< c(/, fc)o fe /i, ( — n 

p \p+\X-Xi 



\S(x) - D n (x)\ ^ Y, l«^ x )l 

/+3-3fc 
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Finally by means of (1.6) 



<f(p) 



\S(x) - D n (x)\ ^ c(l, k)a k ^!^Y. h 



i=l 



< c(l,k)a k <p(p), 



what is (2.15) in the case A = 0. If < A ^ s and x G I u , then with the help of 
representation (see (2.17)) 



n A 



i=l,i^vq=0 ^' 

taking into account (2.11) and observing that it follows by (2.16) that 



„(<?) _ Si) 



C 

i„ ^ hi 



^ ^q S'i,j/) 



we can obtain (2.15) for A > by repeating the previous arguments. This concludes the 
proof. Note that we used no continuity of the spline S in the proof. □ 



(3.1) 



3 Main lemmas 

For every j = 1 , n we set 

x o . = x o ^ f cos (U ~ l/4)vr/n), if j < n/2, 
Xj " Xi ' n '""\cos((j-3/4)7r/n), if j> n/2; 
Xj := x jt n := cos (j - l/2)ir/n; 
tj(x) := tj :n (x) := (x — Xj) cos 2 2narccosx + (x — Xj) sin 2 2narccosx 

the polynomial of degree 4n — 2; and for j G H, b € N denote 

dj := d jn (b,Y) := mmt b Ax) \U(x)\ ; 
xeij 

t b Ax)U(x) 
Tj(x) := T jtn (x,b,Y) := 3 d ~ 



the polynomial of degree 6(4n — 2) + s. Obviously for j G H dj > 0, so the definition is 
correct. 



Lemma 6. Let j £ H and b eN, b ^ ^, if e $ k . Then 



chj M \U( Xj )\ ^ dj ^ Chj 2b |II( 
Tj(x)U(x) > 0, x € J, 



'j;i ' 



<£>(/ij)|Tj(x)| ^ <^(p), x G /,-, 

4fe+fc 



V(^i)l T i(^)l > <P(P) 



P 



\X — Xj\+ p 



n(x) 



nfx,- 



x e J, 



¥>(/ij)l r j(aOI < C( ^(p) 



|X — Xj\ + p 



x e I. 



(3.2) 
(3.3) 
(3.4) 

(3.5) 
(3.6) 
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Proof. For dj we have by (1.8) (see also [7], Proof of Lemma 17.2) 
t b Ax) |II(x)| > cmin{(x - x°)~ 26 , (x - Xj)~ 2b } \U( Xj )\ ^ chT 2b \U( Xj )\ , i£l 3 



3 

It is known that 

So the above estimate of dj is 

4-b 



tj(x) < 312/tJ 2 , x € /,-. 



.7 

*J(x) |n(x)| < Chj' M \U( Xj )\ , xElj. 



.-2b I 



Also we have 








_J ! ! 1 


><c( 


/ 


1 


\ (x — X 1 -) 2 ' (x — Xj) 2 t 


Jx- 


- Xj \ + ft j 


and 








_,..( 1 1 \ 


■ ^ c( 




1 


\ (x — X^) 2 ' (x — Xj) 2 J 


x - 


- XJ + ftj 


And for the polynomial Tj we obtain 









, X £ Ij, 



X & Ij. 



if(hj) \Tj(x)\ ^ c(p{p), x e /,-; 
taking into account the estimates of dj, (3.7) and (1.2)— (1.8), for x £ I 



ip(hj) \Tj(x)\ < cy>(fcjO 



^ ap(hj 



h; 



\Jb Jb n\ | / t"i 



LX 1 *^ 7 I ' ("i 



26 



n(x) 



n(x,) 



26-s 



< c^f yM|x-Xj| + p) 



I _ I _l_ \ k l 2 



< cl p(p) 



< c v?(p) 



p 



h 3 

Jb Jb n \~ I L n 



26- 



26- s 



Jb Jb 1 | I L q 



26-s-fc 



P 



|X — Xj| + /9 

Applying (3.2), (3.8) and (1.2)-(1.8), for x I, 
<^) |T,(x)| > cy(M( [a ._^ 



26 



n(x) 



^ C(/3 



P 



n(xj 



> 



C<£ 



,2 \ / l \ 26 



26 



n(x) 



p 



x — Xj\ + p J \\x — Xj\ + h 



h .i 



U(xj 
U(x) 



> CL P{p) 



> c f(p) 



x — Xj\+pj \\x — Xj\ + h 



hi 



U( Xj ) 

26 



U(x) 



U(xi 



P 



X — Xj\ + p 



46+fc 



U(x) 



U(xi 



(3.7) 



(3.8) 



Now the proof is complete. 



□ 
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In the next lemma we construct correcting polynomials. 

Lemma 7. For each ip G <£ , there exists a polynomial Q n = Q n (x,ip) of degree 
^ (2s + 2k + 16)n, satisfying 

Q n (x)U(x) > 0, x G J; (3.9) 

Q n (x)sgnII(x) ^cp(p), x€l\0; (3.10) 

|Q n (x)| < cMp), xel. (3.11) 

Proof We take a polynomial Tj{x) = Tj(x, b, Y), denned above the Lemma 6, where 
6=[*±*]+4. Put 

Q n (x) := ^ (p(hj)Tj(x). 

j&H 

Now (3.9) follows from (3.3). Because at the fixed x all Tj{x) have the same sign, (3.10) 
follows from (3.4). We have also by (3.6) 

tp(hj) \Tj(x)\ < c^f ^-— ) . 

\|x -Xj| + pj 

Hence we obtain at last with the help of (1.2), (1.5) and (1.6) 

n 

\Qn(x)\ ^cV^(p) 

^-^ \\X — Xj\ + p 

2 



. 3 
P 



^Y!#jf ' i r-— ) <ci<p(p), 



that is (3.11). D 

4 Proofs of the positive results 

Lemma 8. Let ^6$ fc and S G £fc_i ; o- Assume that 

a k (S,(p) < c 2 , (4.1) 

and 

S(x)II(x) > 0, x G / (4.2) 

Tnen £/iere is a polynomial P n of degree ^ en suc/i t/iat 

|P n (x)-5(x)|^C^(p n (x)), (4.3) 

and 

F n (x)n(x) > 0, x G I. (4.4) 

Proof. We begin with the polynomial Z) n of Lemma 5, associated with the spline S 
(we take I = 3k — 1) of degree ^ en. We have 

\S(x) - D n (x)\ ^ C C2<p(p) (4.5) 

Now we proceed with the polynomial of degree ^ en 

R n := D„ + C c 2 Q n , (4.6) 
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where the polynomial Q n is defined in Lemma 7. By virtue of (3.11) and (4.5) we have 

\S(x) - Rn(x)\ < cip(p), xGl, 

and by virtue of (3.10), (4.2) and (4.5) 

Rn(x) sgall(x) ^ CoC2Q n {x) sgnLT(x) + 5(x)sgnn(x) 
-\S(x)-D n (x)\ >0, x€l\0. 

It remains to modify the polynomial R n so that it will be copositive with / also in O. For 
this we have deal only with D n , because by (3.9) copositivity takes place for Q n . So we 
take O q , a connected component of closure O of O and we let yi q+ \,...,yi q+u G O q (v ^ s). 
Also we put Xj q to be the closest to O q , such that Ij n O = 0. Then (2.14) and (2.15) 
together with (4.1) and (4.2) yield for i£0 ? 



\S(x)-D n (x)\ 



[\(x - yi q+fl )[x,y iq+1 ,...,y iq+u ;S - D n ] 



n=i 



1 



(iz + 1)! 

< Op(Pn(d))Pn"(0) 



H(x-y lq+ ,)(S^(0)-D^(9)) 



(4.7) 



n ( s ~ y^+M^ 



by the square brackets we denoted the divided difference of order v of the function S — D n 
and 9 £ O q . For the definition and necessary properties of divided differences one can see 
[7], ch.l. Hence 



\S(x) - D n (x)\ < ap(p n (xj q )) t-t^" 1 ! lq -^ 

LL=1 \ X j q ~ V 



^ c 3 ip(p n (x jq )) 



fj,= l \~Jq oiq+fl\ 

U(x,Y)\ 



(4.8) 



|n(^,F)| 



For the first inequality in (4.8) we used the fact that O q is connected and contains at most 
3s intervals, so that 



O, 



Pn{0) ~ Pn(Xj q ) ~ (Xj, - % q+M ), 1 < P < I/; 



and for the second inequality we used the above together with the fact that for any yi O q 



\x-Vi\ 



~> 



\x-yi\ 



> 



Pn(x) 



>c>0. 



\ X J,-Vi\ \ x ~ x j q \ + \ x ~ Vi\ \O q \+p n {x) 

Now we again make use of polynomials Tj from Lemma 6 (and again we take b = [^2 ] ^~^) ■ 
By (3.5) we have 



V(fyJ | T ig(^)| > C^{pn{Xj q ))- 



|n(x,y)| 
n(x, 9 ,y)| 



ieO„ 



(4.9) 



Thus if we set 



C3 



, 3q .-- —<p(h jq )T jq , 

C4 
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and 

D n :=D n + J2T Jq , 
q 

where the sum is taken over all connected components O q , then it follows by (4.8), (4.9), 
(3.3) and (4.2), that for x G O qo 

D n (x)U(x, Y) = (D n (x) - S(x))U(x, Y) + T jqQ (x)U(x, Y) 

+ J2 T jq (x)U(x,Y) + S(x)U(x,Y) ^ 0. 
i¥=qo 

Hence, for the polynomial 

r n : = ti n + y ^ 1 j q , 
<? 
we have 

P n (x)U(x,Y) ^ 0, x€l. 

Note that one can prove that 



£^(z) 



^ C5<p(p n (x)), X £ I 



as in the proof of Lemma 7. So we observe that 

\S(x) - P n {x)\ ^ CQ(p(p n (x)), x£l. 
This concludes the proof. □ 

Proofs of Theorems 1 and 2. Theorem 1 for n^cj^k + r — 1 now follows from 
Lemmas 2 and 8 with the help of Statement 2. While for k + r — 1 ^ n < cj it follows from 
the Statement 1. Theorem 2 for n ^ N(Y) follows from Lemmas 3, 4, 8 and Statement 2, 
and for k + r — 1 ^ n < N(Y) it follows also from Statement 1. □ 

5 Counterexamples 

Example 1. For each s > 1, r < s — 1, A > 0, n £ N there exists a collection 
Y := Y(n, r, A, s) and function f := f n ^ A G C^ D A^(Y) such that 

EJ?\f,Y) > ^ 2 (/ (r) ;l) > A2- k+2 u; k (f^;l), k > 2. (5.1) 

Proof. We assume without loss of generality that n ^ r + 1. Let choose 6 £ (0, 1) 

from the condition 

1 6 r 

45 n 2(r+i) ~ 4( r + i)l ~ 

We fix arbitrary collection Y of points yi such that — 1 + 6 = yi > y2 >••• > 2/s > — 1- 
Following [2], set 



Q r+1 (x):=(x- yi ) r+1 ; 
f(x) := (x-yi) 



r+l ._ I Qr+l(x), if X^ -1 + 6, 

+ |0, if x< -1 + 6. 
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Evidently / G C (r) n A( )(y). We take an arbitrary polynomial P„eP„n A^iY). Let's 
denote 

Rn(x) := Q r +l(x) - P n {x). 

We consider divided difference 

[yi,...,y r +2;Rn\- 

Because P n G A^°'(Y), there is P n (yi) = 0, i = l,r + 2, so [yi,..., J/r+2! -fn] = 0. Besides, 
[yi,...,y r+2 ;Q r +i] = 1, that is, 

[yi,-,y r +2;Rn] = i- 

Therefore the point 0€ (-1,-1+6) exists such that 

i#+i)(0) = (r+l)!^,..,^;^] = (r + 1)!. 

Applying Markov's inequality we obtain 

(r + 1)! = 4 r+1 >(0) < n 2 ^ +1 ) ||i?„|| < n 2 (' +1 >(||/ - P n || + ||/ - Q r+1 ||) 

= n 2 ( r+1 >(||/-P„||+& r+1 ), 

whence 

II f P\\> (r + 1)! b r+1 

WJ ^11 > „2(r+l) ° • 
On the other hand , 

^ 2 (/ {r) ; 1) = W2(/W - Q&; 1) < 4 |/W - Q^l = 4(r + 1)!6 . 

Hence 

ll/-- P n|| 1 & r 

> ^TTT-T7 77 = A 



w 2 (/W;l) 46n 2 ( r + 1 ) 4(r + l)! 



D 



Example 2. Let r = s — 1. For each A > 0, n G N i/iere exists a collection 
Y := y(n, r, A) and function f := / n>rv4 G C^ n A(°)(y) suc/i i/tai 

4°)(/,y) > Aa; 3 (/ (r) ;l) > ^2- fe+3 ^(/M;l), k > 3. (5.2) 

Proof. We assume that n ^ r + 2. Let choose 6 G (0, 1) from the condition 

1 &r 

4(r + 2)6n 2 ( r+1 ) ~ 4(r + 2)! ~ 

We fix arbitrary collection Y of points yi such that — 1 + 6 = yi > y 2 > ... >y s > — 1. Set 

Q r+2 (a;) := (x - yi) r+2 ; 

10, if x < -1 + 6. 

Evidently / G C( r ) n A(°)(y). We take an arbitrary polynomial F n eP n n A(°)(y). Let's 
denote 

R n {x) := Q r +2{x) - P n {x). 
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We set y r +2 := — 1 and consider divided difference 

[yi,:.,y r +2;Rn\- 
Because P n G A^(Y), there is P n (yi) =0, i = l,r + 1, so 



[yi,-,y r +2;Pn] 



PJ-D 



> o. 



n(-i) 

Besides, [yi,--,y r +2\ Qr+2] ^ 0, and (see [7], Theorem 1.2) 

-[l/l,-, Vr+2] Qr+2] > (J/1 - ?/2) +••• + (Vr+l ~ Vr+2) = b 

that is, 

[yi,-,y r +2;Rn] < -&■ 

Therefore the point #G(— 1,-1 + 6) exists such that 

R { ; +1) (e)\ = (r + l)l\[ yi ,...,y r+2 ;Rn}\ > (r+l)!6. 
Applying Markov's inequality we obtain 

(r + 1)!6 < |i# +1) 0?)| < n 2 < r+1 ) ||i?„|| < n 2 ( r+1 )(||/ - P n \\ + ||/ - Q r+2 ||) 

= n 2(r + l) ( || / _p n || +6 r + 2 ) 

whence 



n 



2(r+l) 



On the other hand , 

u*(/M; 1) = a*(/M - gW 2 ; 1) < 8 |/W - Q^ 2 || = 4(r + 2)!6 2 . 
Hence 



ll/-^n| 

w 3 (/M;l) " 4(r + 2)6n 2 ( r + 1 ) 4(r + 2)! 



> 



A. 



D 



Example 3. For any collection Y S Y s t/iere is a function f = f(Y) G A^°'(Y) 
such that 



,. ^ 0) (/,y) 

hmsup — - - = 00, 

n^oo Uk(J,l/n) 



(5.3) 



w/ien /c > 4. 



Proof. Note that it is sufficient to prove (5.3) only for fc = 4, so we will deal with the 
fourth modulus. Denote 



IIi(z) := 



Htnix-Vi), 2, 

1, s = l. 



For 6 G [0, 1] we construct a function 5b such that 

S b (x) ■- 



I 1, \x\ > 26; 
\0, |x| < 6; 

0<5 6 (x)<l, 6<|x|<26; 5 s eC (4) . 
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For example for x £ [b,2b] one can set 

S b (x)= [ {u-bf{2b-ufdu[ I {u-b)"{2b-u)"du 



26 \ -J 

4/oJ, „.\4j„,[ / /„. i,\4/ol „.\4, 



and Sb(x) = Sb(—x), x £ [—2b, —b\. Put 

q b (x) := ((x - yi) 2 - b 2 )(x - y ± ), 

9b{x) := q b {x)S b {x - yi) , Q b (x) := q b (x) . 

1 1 Will ||1J-1 1 

For these functions there are following relations: 

g b (x) = 0, x e [yi - 6,j/i + 6], 

9b e c< 4 > n A(°)(y), || 56 || < ||Q 6 || < || 96 || < 10, 

\\9b- Qb\\ [yuyi+2b] ^66 3 , 
evidently < S = nyy < 1, therefore 

V4(g b ;t) < 16|| 56 -Q 6 ||+t 4 |b[ 4) || < 966 3 + 10i 4 (s + 2) 8 < c 7 (b 3 + t A ) 

For each P n eV n D A(°)(Y) we denote 

Rn(x) = P n {x) - Qb(x), 

and obtain 

P' n { yi ) > 0, R' n ( yi ) > -Q'b{yi) = Bb 2 . 



(5.4) 
(5.5) 
(5.6) 



(5.7) 



(5.8) 



Applying Bernstein's inequality (we assume n > s + 1), for < h ^ 1 — \yi\ from (5.6) 
and (5.8) 



Bhb 2 < hR' n ( yi ) ^n\\R n \\ [yi _ h:yi+h] < n(\\g b - Pn\\ [yi - h , yi+h ] + 66 3 



Hence 



hb 2 ,o 

\9 b -Pn\\ [yi - h , yi+h] >B—-6b\ 



n 



For n > s + 1 put 



and note that (5.7) implies 



b n = n 4/3 ; f n (x) = g bn (x) 



W4(/«;t) ^2c 7 t 4 , t> 1/n. 



(5.9) 
(5.10) 

(5.11) 



First we put e = 0.1 and choose uq so large that n^ > 4c-? and b no ^ — f^-. Set do := 1 



and 



di :- 



B 6n,-_i6j 



3-1 «j 



40 n, 



d,-_i 



<j-i, 



(5.12) 



where the increasing sequence {n u } ( ^ =1 is defined by induction as follows. Suppose that 
{no,..., n<j_i} have been defined, then put 



<T-1 



F <r _i:=£d i _i/ n . (F :=0) 
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and take n a > n a -\ so large that 

and 

We will put 



F, 



(i) 



cr-l 



< d a -W%, 



Bb na _ x > 12n- £ . 



*<x— 1 •> 



(5.13) 
(5.14) 
(5.15) 

(5.16) 



the uniform convergence of the series follows from (5.5) and the inequality 

°° / 1 1 \ 

Y^d^ l <d a Al + - + — 2 +...\<2d 

j=cr 

which follows from (5.12), (5.10) and < B < 1. Next we define 

oo 
/:=/(y):=X>_i/ n „ 

3 = 1 

and note that (5.5) yields that / £ A^°'(Y). Inequalities (5.11), (5.16) and Uq > Acy yield 

/ 1 \ 1 °° i i 

uA $ CT ; — I < 207-4^ d,-i < 4crd a -i—T < da-i—rz 

and (5.13) provides 



(5.17) 



J=o" 



uA i^-i; — 1 ^ 



n„ nl 



r a-l 






So for all <r 






(5.18) 



By (5.4) F CT _i is zero on [yi - b n<y _ 1 ,yi + &n CT _i] =: </<r, so we may write 

f(x) = d a -lfnA x ) + ®<r+l(x), x ^ d a . 

For each P n £ V n n A(°)(7) let p„ CT = l/d CT _iP„ CT . Since p„ CT £ A(°)(F), then by means of 
(5.9) 

||p / || >B h _J^L _ 66 3 

Mazier JfiaWJa- -^ n CT 

On the other hand (5.5), (5.16), and (5.12) yield 

oo 1 , ,2 

H^+ill < 10 V d,_i < 20d CT = -rf g _i£ rag ~ 1 " CT . 



j=<T+l 



/'n 



Hence 



11/ - -PriJI ^ 11/- ^nJ|j CT ^ ||-Pn CT ~ d<r-l/nJ|j CT 
= dff-1 lbn CT - /nJ|j CT - ||*«r+l|| 

(\ b n ,bl q 
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Now by (5.10) and (5.14) 

i/-iU>«~>(|&-4)=«~.((£) ,1/ '~-(£) 4 ). 

And by (5.18) 



11/ P nA ^ ^l/3-2e_ n -e\ 



OO, (T — ► OO, 



w4(/;l/n<r) 

which implies (5.3). □ 
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Abstract 

Let E p (G), 1 < p < oo, be the Smirnov class of analytic functions 
on a bounded domain G containing 0. We assume that the boundary T 
of G consists of N closed analytic Jordan curves. For any nonnegative 
integers n and m denote by MP n m the class of all meromorphic func- 
tions h on G that can be represented in the form h = a/(3z m , where 
a £ E p (G), f3 is a polynomial degree at most n, /3 ^ 0. We investigate 
the problem of approximation of a function / £ L p (T) by functions in 
the class MP n m . A theorem giving necessary conditions for a function 
belonging to the class M.^ m to be an element of best approximation is 
proved. As a consequence we show that the denominators of the best 
meromorphic approximants are orthogonal polynomials with respect 
to a weight that varies with n. 

AMS Classification: 41A20, 30E10, 30D50. 

Key words: Meromorphic approximation, Best approximation, Orthogonal 

polynomial. 

1 Necessary Conditions for an Element of Best 
Approximation 

Let G be a bounded iV-connected domain and let the boundary r of G 
consist of closed analytic Jordan curves. We assume that V is positively 
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oriented with respect to G and € G. Let 1 < p < oo. Given a pair (n, m) 
of nonnegative integers, we consider the following collection of meromorphic 
functions 

M p n>m = Ml tm {G) = {h = a/[3z m , a e E p (G),p £V n ,f3^0}, 

where E p {G) is the Smirnov class of analytic functions on G and V n is the 
class of algebraic polynomials of degree at most n. 

Denote by L p (T), 1 < p < oo, the Lebesgue space of functions with 
integrable pth power. For cp £ L P (T) the norm of (p in L p (T) is 



\<p\\p = (f r \<p(om\) 



i/p 



The Smirnov class E p (G), 1 < p < oo, of analytic functions on G consists 
of the functions <p for which there is a sequence of domains G& with rectifiable 
boundaries having the following properties 

Gk+i c Gk, Gk c G, [jGk = G 

k 

and 

sup/ \ V ($\P\dt\ < oo. 

k JdG k 

Here and in what follows we consider the Smirnov class E p {G) as a subspace 
of the space L P (T). The condition 

^^ = for all zeC\G (1) 

r £ - z 

is necessary and sufficient for a function y? G ^i(r) to be the boundary value 
of a function in the Smirnov class E\{G) (see [5] and [8] for more details 
about the classes E p (G)). 

Let / belong to the space L p (T), 1 < p < oo. The error in best approxi- 
mation of / in the space L p (T) by functions from the class Ai? n m is denoted 
bv A p • 

A£, m = A£ )ro (/;G)= inf ||/-%. (2) 

heM p n , m 

It is not difficult to prove that there exists a function h n , m £ A4^ m for 
which the infimum in (2) is attained: 



^■n.m ~ II J "-", 



m| |p 
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(see, for example, [6] for p = oo). 

We represent h n ^ m in the form h n , m = P/Qz m , where Q is a monic 
polynomial with all its zeros in G, P G E p {G) and P ^ at the zeros of Q. 

From the relation 

K, m = mf [llf-u/QCWp, (3) 

u£Ep(G) 

we can assert with the help of duality relations (see [9], [10]; for the case 
when G is the unit disk see [3] and [4]) that there exists a unique function 
<f>€E q (G), \\Q£ m <P\\ q = 1, l/p+ l/(z=l, such that 

QiocHOV - hn, m )(0dt = K, m \Qm m m\ q i#i w 

almost everywhere on V if 1 < p < oo, and 

Qiormv - KmXOdt = \(f - h n , m )(o\m (5) 

almost everywhere on T if p = 1. 

We now formulate a theorem giving necessary conditions for an element 
of Ai^ m to be an element of best approximation to the function / in the 
space L p (T) in the class M p nm . 

Theorem 1. Suppose that f G L p (T), 1 < p < oo, and h n ^ m = P/Qz m 
is an element of best L p (T) approximation to f in the class .M^ m . Then 
there exists a function if £ E g (G) with \\Q 2 S t m (p\\ q = 1, l/p + 1/q = 1, such 
that 

qHocwou - VmXo# = K,m\Q 2 m m mmi (6) 

almost everywhere on V if 1 < p < oo, and 

Q 2 (orv(6(/ - Vm)m = k/ - hn, m )(o\m (7) 

almost everywhere on T if p = 1. Moreover, for all 1 < p < oo we can assert 
that degQ = n. 

The occurrence of the constant 2 in the power of Q is a consequence of the 
fact that Q is constructed from free poles of the meromorphic approximants 
(compare with the factors in (6) and (7) that are connected with the "fixed 
part" of the element of best approximation). 

In connection with Theorem 1 we single out paper [7] , where an analog 
of (6) was obtained in the case when / is continuous on T and p = oo. 

This paper is organized as follows. The proof of Theorem 1 is presented in 
Section 2. In Section 3 questions concerning the orthogonal polynomials are 
investigated. Using equations (6) and (7) we prove that Q is the orthogonal 
polynomial with respect to a weight that varies with n. 
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2 Proof of Theorem 1 

We now show that in the case when 1 < p < oo the polynomial Q satisfies 
(6) on r. The case p = 1 can be treated in the same manner as 1 < p < oo. 
It will be assumed that A^ m > since for A^ m = the assertion is obvious. 
Fix an arbitrary u G E p (G). It is easy to see that we can represent u 
in the form u = aQ — (3P, where a £ E P (G) and (5 is a polynomial with 
deg (3 < n. For any real e, let 

where to is an arbitrary monic polynomial of degree d = n — deg Q with all 
its zeros in G (for d = we can take oj = 1). We remark that /i n ,m,£ belongs 
to the class Ai^ m . Assume that |e| is small enough so that Quo + e/3 ^ on 

r. 

We choose a function 4> G E q (G) with ||Q£ m <^|| 9 = 1, satisfying (4). Let 
us estimate 



Since 



we have 



h = j r Qiormif - h n , m M)dc 



i^n.m.F iln.rr, 



v n.m.e 



{Qu + ef3)Q£ m ' 



u = I Qiocm (no - kuo - lQu + ;;/ ( | Q(0e , ) « 

Next we claim that 

||/-^n,m,e||p = ||/- Vm|lp + °( £ ) as £ ^ 0. (10) 

Indeed, we have 

||/- V"i,e||p = ll/-Vm.llp + e ^ + °( e ) aS ^^O, (11) 

where 

A:=-(A^J 1 ^|j(/-^, m )(0r 2 Re((7^7^)(e)^|^) |dfl. 
Since, for any e, 

\\J ">n,m, e\\p — \\J h"n,m\\pi 
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from (11) we get that A = 0. Thus (10) holds, and consequently 

\Ie\ < ||/ - hnwWpWQCttlq = 11/ " KmWp + o{s) = K,m + o{s) (12) 

as e —> 0. 

Letting £^0we obtain from (9) and (12) that 

r Q(£M£) 

Since u is an arbitrary monic polynomial of degree d and u is an arbitrary 
function in E P (G), we can conclude from (13) that d = 0,degQ = n, and 
4> = Q<p, where ip G E q (G). In view of (4), this completes the proof of 
Theorem 1. • 

3 A Connection with the Orthogonal Polynomials 

3.1 G is the Unit Disk, 1 < p < oo 

Our goal in this section is to prove that Q is a nonhermitian orthogonal 
polynomial for a weight that varies with n (cf. [1] and [7]). 

Here and in the next subsections we assume that / is holomorphic on 
C \ E, where E is a compact set, E C G. Since / is holomorphic on T it 
follows from (6) and (7) that the function Q 2 £, m <p(f — h n ^ m ) can be continued 
analytically across V (see, for example, [9], [10]). Let D be an arbitrary 
domain such that E C D and D C G. We assume that the boundary 7 
of D is positively oriented with respect to D and consists of finitely many 
disjoint contours. 

We first consider the case when G is the unit disk with the center at 0. 
In this case the Smirnov class E p (G), 1 < p < 00, coincides with the Hardy 
class H p {G) of analytic functions on G. The Hardy class H P {G) consists of 
the functions ip such that 

sup / \(p(re l9 )\ p d6 < 00. 

0<r<l JO 

We say (p G H P {G) is an outer function if for all z G G 

- I'* ln\v(e M )\ j^^ 2 
2n Jo '^ v 7I \e ie -z 



ln|^)| = £/_ In |^)|^1^. 



A bounded nonvanishing analytic on G function cp is a singular function if 
I </?(£)| = 1 almost everywhere on T. 
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Each function ip in H p {G) has a factorization ip = BSM where B is 
a Blaschke product, S is a singular function, and M is an outer function. 
These factors are unique up to multiplicative constants of modulus one (see 
[3], [4]). 

We remark that since the function Q 2 £"V(./ ~ h n , m ) can be continued 
analytically across V, (p has no a nonconstant singular factor, and cp has a 
finite number of zeros in G. 

Let n > 1. Denote by B the Blaschke product 



*<*) = n i 



z Sfc,n 
/ i ' <Lk,n z 



constructed from the zeros of Q(z) = Y\( z ~ £fc,n)- Set 

fc=i 

n 
fc=l 

Denote by £ n the number of zeros of ip in G (counting multiplicity). For 
l n > 1 let i?i be the corresponding Blaschke product constructed from the 
zeros rji ;n , . . . , r\\ n<n of ip in G and let 

' n 

Un(z) = Y[(l-rik,n z )- 
k=l 

Set uj n (z) = 1 and B\(z) = 1 if l n = 0. We can now write 

(QV)(*) = B^B^z^iz), 
where ^ £ H2 q {G) and ^ is an outer function. Therefore, by (6) and (7), 

B 2 (0Bi(0C^ 2 (0(f - hn,m)(0 di = A^ m |^(6| 29 Kl (14) 

almost everywhere on T if 1 < p < oo, and 

B 2 (0Bi(0C^ 2 (0(f - h n ,m)(0dti = \(f - h n , m )(0m\ (15) 

almost everywhere on T if p = 1. 

We now show that for 1 < p < oo the polynomial Q satisfies certain 
orthogonality relations with respect to a varying complex-valued measure 
and we find an integral expression for / — h nm . 
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Theorem 2. Suppose that n > 1 and 1 < p < oo. T/ien t/ie function 
B£ > m i[) 2 ~ q (f — h„ jm ) can be extended analytically to C \ E. 

Moreover, the function ip*{z) = ip q (z),z £ G, can be extended analyti- 
cally to C\E~ 1 , where E^ 1 is reflection of E in the unit circle, and satisfies 
the equation 



2^i 7 ~i = A «^UJ? Z£C \ D - < 16 ) 

Furthermore, 

t Q(0 277\ — 77\ = f° r v = 0,l,...,n + l n -l (17) 

and, for z G C \ D, 

(BTr^ 2 -Vw n u; n )(z)(f - h n , m )(z) = -L / (g ^ 2 "'f )(0 ,f , (18) 

27TZ J 7 U>n(£ Vn(£)U ~ £) 

where T is any polynomial of degree at most n + l n . 
Proof. Let $ = 5fV(/ - &n,m)- By (14), 

$(0/^(6 = 5(0r^- 9 (6(/ " ftn,m)(0 

=^»i(^^=^5(ok(o^G)^ (19) 

almost everywhere on V. Taking into account now that <& can be continued 
analytically across T, ip q is an outer function, and the fact that the boundary 
values of 3> /ip q on T belong to L2(T), we can conclude that &/ip q £ H2(U r ) 
for some r > 0, where U r = {z : r < \z\ < 1}. Thus, by (19), the functions 
B£ m ^ 2 ~ q (f - h n ,m) and B£ m ijj 2 ~ q (f — h n ,m)/w n uj n can be extended ana- 
lytically to C \ E and the function V*(£) can be extended analytically to 
C\E~ 1 . Moreover, 



Bti m ip 2 - q (f-h n , m ) Q 



w n ui n 
Consequently, 



1 



z n+l„+l 



as z — > oo. (20) 



?Q9 27^ — 7F\ — = for v = 0,l,...,n + ln-l. 

7 KlO^nlO 
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Let T be any polynomial of degree at most n + l n . By the Cauchy 
formula, 

(BT£ if; q /w n oj n )(z){f -h n ,m){z) - 



2m J 1 w n (£)uj n (£)(z - £) 
zeC\D. 

In particular, 

zeC\S. (21) 

Then, by (19), we get 

3.2 The Case when G is the Unit Disk and p = 1 

We now consider the case when p = 1. Let 

^) = ex P ^- j / r ^— log ^- ideij, n<i. 

It is not hard to see that v £ H2(G), v is an outer function, ||f H2 = 1, and 

l(/-fcn,m)(OI=Ai >m K£)| 2 

almost everywhere on T. So, by (15), 

B 2 (o*i(orv 2 (0(/ - v»»)m = Ai )TO |t;(oi 2 idei 

almost everywhere on T. 

Applying the same arguments as in Subsection 3.1, we obtain the fol- 
lowing theorem. 

Theorem 3. Suppose that n > 1 and p = 1. Then the function 
B£ m il) 2 (f — h ntm )/v can be extended analytically to C \ E. 

Moreover, the function v*(z) = v(z),z £ G, can be extended analytically 
to C \ E~ l , where E^ 1 is reflection of E in the unit circle, and satisfies the 
equation 
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Furthermore, 

Z Q(Z)-27£\ — 7tw^ = ^ or " = 0,l,...,n + Z„-l (23) 

and, for z G C \ D, 
(BTr^/w n co n v)(z)(f - h n , m )(z) = -L / ( ff^ 2 ^! )dg , v (24) 

27TZ 7 7 W n (£)c<; n (£)v(£)(2: - ?) 

where T is any polynomial of degree at most n + l n . 

3.3 G is a Multiply Connected Domain 

We consider the case when 1 < p < oo and G is a bounded domain with the 
boundary V consisting of N disjoint closed analytic Jordan curves 1^ , k = 
0, . . . , N — 1. It will be assumed without loss of generality that N > 2. 
Assume that G lies interior to To- Let a^ be an arbitrary point lying inside 
ofr fc , k = l,2,...,N-l. 

We represent d£ in the form d£ = S(£)K|, £ G I\ Let $i = Q 2 £ m ip(f - 
h n>m )S. By (6), $i > on T, and 

*i(0/(qs)(o = A^ m QioirnQ(0i 29 - 2 i^)iV5(e) 

almost everywhere on T. We assume that &i/QS = Q£, m tp(f — h n;m ) / 
on r. Let x = hid r ($i(£)/(Q<S') (£))■ Jt is eas y to see tnat X = N - 2 - n. 
Let n > AT, then x < -2. 

Since % < —2, it follows from the results of the theory of boundary value 
problems for analytic functions (see, for example, [2]), that there exists a 
function g holomorphic on G and continuous on G, 

N-l 

g(z) = H (z - o fc ) exp(-tt(*)), z£G, 
k=i 

where 

JV-1 



iog(r x U(t- a k)®i(0/(QS)(o) 



*(*) = ^ / — — *^— * «, 

27rz Jr £ — z 

such that the function (gQ£, m p)(z)(f—h ntm )(z) can be extended analytically 
to C \ G, and 



(gQz m <p)(z)(f - hn, m )(z) = O 



1 



as z — > oo. 
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Consequently, Q satisfies the following orthogonality relations 

j r Z"(9QC<PfmdZ = 0, v = 0, . . . , |x| - 2. (25) 

Denote by T any polynomial of degree at most \\\ — 1- By the Cauchy 
formula, 

(TgQz tp){z)(f - h ntm )(z) = —I , z£C\G. 

ZTTl Jr z — 4 



Remark. The orthogonality equations (25) must be modified accordingly, 
if we drop the constraint that Q(, m (f(f — h n ^ m ) / on T. 
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Abstract 

Consider the persistence and the global asymptotic stability of models governed 
by the following pure-delay type nonautonomous Lotka-Volterra differential system: 



dxi(t) 
dt 



x l {t){c l {t)-Y,Y, a \^ t ) x ^ t - T ^^ t ^ t o, l<»<n, 



3=1 1=0 

Xi(t) = 4>i{t) > 0, t < to, and <fii(to) > 0, 1 < i < n, 

where each (j>i(t) is a continuous function for t < to, each Cj(t) and a\At) are bounded 
continuous functions on [to,+oo) and 

m 

inf Q2 a l u (t)) > 0, o|j-(t) > 0, i < j < n, 1 < i < n, and n > 0, < I < m. 

*- to 1=0 

In this paper, for the above system of pure-delay type, by improving the former 
work (Communications on Applied Nonlinear Analysis 9, 31-45) which extended the 
averaged condition offered by S. Ahmad and A. C. Lazer (2000, Nonlinear Analysis 
40, 37-49), we offer conditions of persistence which are independent of time delays, 
and considering a continuous Lyapunov-like (nonnegative and non-differentiable) 
function to the above pure-delay type system, we establish sufficient conditions of 
global asymptotic stability. 

AMS Subject Classifications : 34K20, 34K60 

Key words : persistence ; global asymptotic stability ; pure-delay type nonautonomous Lotka- 
Volterra differential system. 

1 Introduction 

Consider conditions of the persistence and global asymptotic stability of the following 
pure-delay type nonautonomous Lotka-Volterra delay differential system. 

dx-(t) n m 

al 3 = 11=0 V-V 

Xi{t) = 4>i{t) > 0, t < to, and 4>i(to) > 0, 1 < i < n, 
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where each <pi{i) is a continuous function for t < to, each q(£) and a\At) are bounded 
continuous functions on [£ ,+oo) and 

m 

inf (Va|;(i)) > 0, aj.-(t) > 0, i < j < n, 1 < i < n, r t > 0, < I < m. (2) 

Recently, in [11], Xu and Chen have established conditions for the permanence of sys- 
tem and using a Lyapunov functional, they obtained conditions for the global asymptotic 
stability of the time-dependent pure-delay-type Lotka-Volterra predator-prey model of 
three species. 

In this paper, for the above system of pure-delay type, by applying the former work 
of Muroya [7] which extended the averaged condition offered by Ahmad and Lazer [2], 
we offer conditions of persistence which are independent of time delays, and considering 
a continuous Lyapunov-like (nonnegative and non-differentiable) function to the above 
pure-delay type system, we establish sufficient conditions of global asymptotic stability 
which extend the results of Zhen and Ma [12] and are a different type from those of Xu 
and Chen [11]. 

Let 

m 

a i(t) =E ( 4( f )' s «(*) - °> s 'j(*) = °ij(*)> ^h < / < m, 

1=0 

m m 

a iL = w£Q2a\i(t)), a iM = sup(^a- i (t)), c iL = inf a(t), c iM = supq(i), 



*>*o ;=0 t>t l=0 " ' *>*o t>t 

a l ij(t) = min (0,a^(t)), a-+(t) = max(0, a^(t)), 

m m 

Kjl = inf (J2 ojj (*)), and t>tjM = sMY. s 3"(*))> 1 < h J < n, W 

- ° 1=0 *^*0 l =0 

1 f h 

m[ci] = lim inf{ / Ci(s)ds \ t < ti < t 2 and t 2 — t\ > tj, 

*-*°° t 2 — t\ Jti 

and 

1 f h 
M[ci\ = lim sup{ / Ci{s)ds \ to < t\ < t 2 and t 2 — t\ > t} , 1 < i < n. 

*^oo t 2 — ti y*i 

Note that b~ iL = bf iM = and en < to[cj] < M[q] < qm, 1 < i < n. 

First, applying the results of Muroya [7] to this system, we improve sufficient conditions 
for the persistence of system. 

Then, the newly extended averaged condition is as follows: 
For any Xi > 0, 1 < % < n such that 

n 

M[ci] > d iL Xi + ^ K jL Xj, l<i<n, (4) 

i=i 

it holds that 

m[ Ci \ > J2btjM%j, l<i<n. (5) 
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Let 

f A L = diag(aiL, a 2 L, ••• , a nL ), B L = \b~ jL ], B^ = \bf jM ], are n x n matrices, 
1 and c = [m[ci\] and c = [M[ci\] are n— dimensional vectors. 



(6) 



Then, the condition (4)-(5) is equivalent to the following: 
For any n-dimensional vector x = [xi\ > such that 

c > (A L + B L )x, (7) 

it holds that 

c > 5+ x. (8) 

Assume that 



(9) 



J A L + B L is an M-matrix, (A L + B L y x c > 
\ and c > Blj(A L + BlY l c. 

Then, (Al + -B^) -1 ^ and Eq.(7) implies that 

(A L + B L )- l c>x, 
and from c > B^(Al + -B^) _1 c and 5^ > 0, we have that 

c > 5+ (A L + sr) _1 c > £+ x, 

which implies Eq.(8). 

Thus, the newly extended averaged condition (4)- (5) is satisfied. 



Put 

f = max Ti, fi = max{ri | a l u ^ 0, < / < m}, 1 < i < n, 

0<l<m 
Xi = f^, Xi = Xiexp(ciMTi), 

5j = (qm - Yl K jL Xj)/aiL, %i = Xi exp{(c iM - X! K jL %j) f i}, 2 <i <n-l, (10) 
i=i i=i 

ra— 1 n— 1 

^n = (CiiM — 2^ ^njL^j)/dnL) X n = X n exp\{C n M — 2_^ U n j L Xj)f n \, 
3=1 3=1 

and assume 



Qm - J2 b ijL x j > 0, 1 < i < n. (11) 

3=1 

Then, 

Xj > £j > 0, 1 < i < n. 

Note that if A^ + -B^ is an M-matrix and (A^ + B^)^ l c > 0, then for n-dimensional 
vector x = [x^ and cm = [qm], we have that 

x > (A L + B^^CAf > (A L + Bl)- X c > 0. (12) 
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Thus, Eq.(9) implies Eqs.(ll) and (12). 

For any two solutions {xi(t)}™ =1 and {yi(t)}™ =1 of the system (l)-(2), we have the fol- 
lowing equation which will be useful to obtain sufficient conditions for the global asymp- 
totic stability of system (see Lemma 8): 

n m 

f t {hi(xi(t)/ yi (t))} = -ai(t)(xi(t) - yi (t)) - EXX-(*)(*;(* - n) - Vj(t - n)) 

j=i 1=0 

m r f n m 

+ E 4(*) / [{*(«) -J2J2 <■(*)%■(* - Tk)}(Xi(s) - yi (s)) 
1=0 ' n j=l k=0 

n m 

j=l k=0 

t >t Q + f , 1 < i < n. 

(13) 
Put an n x n matrix as 



A = [oij], 



where 



t>t 

m n n 

an = a iL - (J2 a\ iM Ti){a iM Xi + max(-q L + (a iM Xi + J2 KjM x j)> c iM - E Kjl x j)}, 

1=0 j=l j=l 



Cl.-ij 



-hjL + KjmH 1 + (%2a\iMn)xi}, j^i, l<i<n. 



1=0 



(14) 
We shall establish the following extension of the Ahmad and Lazer's results in [2] and 

Muroya [7] to the system (l)-(2). 

Theorem 1. (Cf. Ahmad and Lazer [2] and Muroya [7]) 

For the system (l)-(2), assume Eq.(9) and suppose that there exists a nonempty subset 

Q E {1, 2, • • • , n} such that 

CiL ~ E b tjM X 3 > °) f or an V i e Q- ( 15 ) 

Then, the system is persistent for solutions, that is, 

< inf Xi(t) < sup Xi(t) < +oo, 1 < i < n. (16) 

*>*(> t>to 

Moreover, if 

A is an M — matrix, (17) 

then for any two solutions Xi(t) and yi{t), 1 < i < n, it holds 

lim ( Xi (t) - ydt)) =0, 1 < i < n. (18) 

t— »oo 
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In particular, if there exists a positive equilibrium x* = (x]_,X2, ■ ■ ■ , x*) of the system 
(l)-(2) ; then in Eq.(lA), we can take an, 1 < i < n as 



an = a iL - C^a l iiM Ti)ai M Xi, 1 < i < n. 
1=0 



Note that if 

Al — {—B~[ + B~m) is an M — matrix, 

m 

then Eq.(17) holds for sufficiently small delays such that J^ a l iiM Ti, 1 < i < n are suffi- 

1=0 

ciently small. 

The organization of this paper is as follows. In Section 2, using the same techniques 
in Ahmad and Lazer [2] and Muroya [7], we prove that Eqs.(9) and (15) =^> Eq.(16), and 
Eqs.(16) and (17) => Eq.(18). 

2 Conditions of persistence and global asymptotic 
stability 

Consider the persistence and the global asymptotic stability of models governed by a 
pure-delay-type nonautonomous Lotka-Volterra delay differential system (l)-(2). 

We have a lemma. 

Lemma 1. For the system (l)-(2) and 1 < % < n, 

„l n m 

Xi(t) = Xi(t ) exp( / {a(s) - Y^ Yl al ij( s ) x j( s ~ T i)}ds), t > t , (19) 

Jt ° j=l 1=0 

and every solutions Xi(t), 1 < % < n, exist and remain positive for all t > to. 
Proof. From Eq.(l), we have for any t >to, 



d 1 /"' n m 

^7{^7Y ex p(/ { c »( s )-EE a i( s h( s - r ')}*)} = - 

at Xi[l) J to j = i £_q 

Thus, integrating both sides with respect to t on [to,t], one obtains Eq.(19), from which 
we get the conclusion. □ 

We have the following lemma which is an extension of Theorem 2.1 in Xu and Chen 
[11] for a time-dependent pure- delay-type Lotka-Volterra-prey model of three species. 
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Lemma 2. For Eqs.(3) and (10), assume Eq. (11). 
Then, any solutions Xi(t), 1 < i < n of system (l)-(2), are bounded above and 

limsupXj(i) < Xi, 1 < i < n. 



Proof. Let D denote the lower left Dini-derivative. If for some t > t , D Xi(t) > 0, 
then there exists a nonnegative integer l u such that < l± t < m and 

x 1 {t-n»)<$£ = 2i- 

Because, if 

mm xi(t — Ti) > - — , 

0<l<m a\L 



then 



c i(0 - mi a ij(t)xj(t - n) < cim - a 1L ( min xi(t - n)) < 0, 



which implies D Xi(t) < 0, by Eq.(l). 

Therefore, by Eq.(19), xi(t) < Xi(t - T[ u ) exp(ci M Tr u ) < x 1 . 

Thus, if x\(t) > x\ for some t > to, then we have 

D- Xl (t) < 0. 

Now, let us consider the case that x±(t) is eventually decreasing and bounded below by 

x±. Then, lim Xi(t) exists. Set (5 = lim Xi(t) — X\ > 0. 

We will show that (5 — 0. 

Indeed, suppose (3 > 0. Let take any positive constant rj. Then, there exists to > to such 

that 

(5 < X\(s) — X\ < (5 + rj, for s > t , 

since x\(t) — x\ eventually decreases to (3. Thus, we have 

n m 

D~xi{t) < xi(t){ciAf-5^2 a ii(*) a; i(*~ r 0} 

< xi(t)(-a 1L (3), for t>t L = t + f. 

Therefore, we have 

xi(t) < Xi(ii) exp{— P I a 1L ds} : 

which in turn implies, due to / a^ds = +oo, lim xAt) = 0. 

This contradicts xAt) > x\ + (3 > 0. Thus, lim Xi(t) = x±. 
Hence, we have 

limsupa;i(t) < x±. 

£— >oo 

Then, for any fixed positive constant e, there exists a constant t i > t = t such that 
Xi(t) < Xi, for any t > t\ — f . 
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Next, for some 2 < i < n, suppose inductively that for any fixed positive constant e, there 
exists a constant tj_i > t{-i such that 

Xj(t) < Xj + e, for any t > ti-i — f, 1 < j '< % — 1. 

If for some t > ii, D~Xi(t) > 0, then there exists a nonnegative integer l_ it such that 
< in < m and 

i-l _ i-l _ 

Xi(t - T ht ) < {c iM - J2 Kjl( x j + e)}Mi < x i + {(- Y, Kjl)/^}^- 

3=1 3=1 

Because, if 



n 

min Xi{t-ri)> (c m - Y Kjl( x j + e))/o iL , 

0<l<m ' J 
J=l 

then 

n m i— 1 

Ci(*) - S S a'jC*) 3 ^* ^ r ') ^ c ^ ~ S ^x(% + e) - a i^( n 5H? M* - r 0) < o, 
i=i 1=0 j=i °-'- m 

which implies D~Xi(t) < 0, by Eq.(l). 
Therefore, by Eq.(19), 

i-l _ 

Xi(t) < Xiit - T ht ) exp{(c iAf - Y, Kjl( x j + e ))nJ 

3=1 

i-l _ i-l _ 

< x\ = [xi + {(- Y KjL)/aiM}eexp{(ciM - Y Kjl x j) t iJ] 

3=1 3=1 

i-l _ 

xexp{(-^6- L )er;.J. 

i=i 

Thus, if there exists a constant U > U-\ such that Xi(t) > x\ for some t >U, then 

D'Xiit) < 0. 

If Xj(t) is eventually decreasing and bounded below by x\. Then, as similar to the above 

discussions of i — 1, we see lim xAt) = x]. 

Since e > is any positive constant, we have that by inductions of % — 1, 2, • • • , n, 

limsupxj(t) < Xi, 1 < i < n. 

This completes the proof. □ 

The following lemma is an improved result to Lemma 2.2 in Ahmad and Lazer [2]. 

Lemma 3. For Eqs.(3) and (10), assume Eqs. (11) and (15). Then, for solutions 
Xiit), 1 < i < n of system (l)-(2), 

liminfV xAt) > 0. (20) 

ieQ 
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Proof. By assumptions to the system (l)-(2), there exist positive constants 7, bi, < 
I < m such that for i e Q, 

c iL ~ J2 btjM%j > 7» and a iM , a\ jM <k,jeQ,0<l<m. 
By Eq.(l), it follows that for i GQ, 

m m 

x\{t) > Xi(t){d(t) - J212 n ijMXj(t - n) - ai(t)xi(t) - 22ojj(t)a;j(t - n)} 

jgQ 1=0 jeQ 1=0 

m 
1=0 jeQ 

This shows that if 

f(0 = E *;(*), 

then 

m 

V'(t)>V(t){y- , £biV(t-7i)}. (21) 

Now, suppose that liminf V(t) = 0. Then, there exists a sequence {t p }21 1 such that 



i^oo 



V'(t p ) < 0, and lim V(t p ) = 0. 



p— +00 



Since V(t) > and for V* = 7/ (J] 6,) > 0, 



z=o 



^(t)>\/(t)E^(^-nt-r,)), 



«=o 



it holds that for each p > 1, there exists a l p E {0, 1, 2, • • • , m} such that 

V(t p -r lp )>V*. 
Similar to Eq.(19), it follows from Eq.(21) that 



V{t p ) >V(t p - Tl 



rt m 

P )exp(/ P ^-^Wis-r^ds). 
Jt p T h 1=0 



By Lemma 2 and assumptions, there is a positive constant V such that for V(t) <V,t> 

to and for f = max 17, we have that 
u 0<l<m ' 

m 

V(t p ) >P = V* exp({7 - £ 6iV-}f ) > 0, p > 1, 
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which is a contradiction. 
Therefore, 



liminfV(t) >0, 



t^oo 



and hence, Eq.(20) holds. □ 

From Lemma 3, we easily obtain the following Lemmas 4-6 (see Lemmas 2.3-2.5 in 
Ahmad and Lazer [2]). 

Lemma 4. For Eqs. (3) and (10), assume Eqs. (11) and (15) and suppose that Eq. (16) 
does not hold. Then for solutions Xi(t), 1 < % < n of system (l)-(2), there exists a maximal 
nonempty subset J of {1, 2, • • • , n} such that 



and 



J^{l,2,-..,n} 



inf max{ Xj(t) \ j G J} = 0. 



Lemma 5. For Eqs. (3) and (10), assume Eqs. (11) and (15) and suppose that Eq. (16) 
does no£ /io/d. Lei :rj(t), 1 < i < n and J &e as in Lemma 4 and ptrf 

min{ Xj(to) \ j G J} = 5 > 0. 

Then, there exist sequences {s p }^L 1 and {t p }™ 1 such that for any p > 1, 

t0 ^ S p < ^P! 

* P - s p > p, (22) 

max{ Xj(t) | j G J, s p <t < t p } = 5/p, 

and there exists j p G J such that 

x j p ( s p) = s /p- ( 23 ) 



The following lemma is a little improved version of Lemma 2.5 in Ahmad and Lazer 
[2] (see Lemma 2.6 in Muroya [7]). 

Lemma 6. For Eqs. (3) and (10), assume Eqs. (11) and (15) and suppose that Eq. (16) 
does not hold. Letxiit), 1 < i < n, J, and the sequences {s p } p x L 1 , {t p \ p X} =l be as in Lemma 
5 and let K be the subset of {1, 2, • • • ,n} such that J fl K = <fi and JU K = {1, 2, • • • ,n}. 
Then, there exists a number e > such that for all p > 1 and all k G K , 

Xk{t) > e, for any t G [s p ,t p ]. 
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Continuing the proof that Eqs.(9) and (15) =>- Eq.(16), we note that if Eq.(16) does 
not hold and Xi(t), 1 < % <n, J, and j p are as in Lemma 5, there exists an integer j* G J 
such that j p = j* for infinitely many integers p. Let {p q } q x L 1 be an increasing sequence of 
integers such that 

3 Pq = j*, for any q > 1. 

To simplify the notation, let c q = s Pq and d q = t Pq for q > 1, so 

d q -c q > p q , for any q>\. 
Since, according to Eq.(22), 

max{ Xj(t) | j G J, c q <t < d q } < S/p q , 
and for < / < m, 

1 f d q 1 f d q 1 f c Q 1 r d q 

/ Xj(t — Ti)dt — / Xj(t)dt+- / Xj(t)dt—- / Xj(t)dt, 

aq Cq J Cq ttq Cq J c q ttq Cq J c q — T\ aq Cq J dq—ri 



we have that for any j G J, 



f " Xj(t - n)dt = 0, < I < m. 

Cq J c q 



lim 

9^°° d q — C, 

Since, according to Eqs.(22) and (23), Xj t (c q ) = S/p q > Xj :f (d q ), we have that for any 

<Z>1, 

ln(^(d g )/^(c,))<0. (24) 

The following lemma is an improved version of Lemma 2.6 in Ahmad and Lazer [2] 
(see Lemma 2.7 in Muroya [7]). 

Lemma 7. For Eqs.(3) and (10), assume Eqs. (11) and (15) and suppose that Eq. (16) 
does not hold. Let J and K be as in Lemmas 5 and 6. Then there exists for each k G K , 
a number Xk > such that for any k G K , 

M[c k ) > d kL x k + J2 hjLXj, (25) 

jeK 

and there exists a j* £ J such that 

m[c jt ] < J2 j*kM%k- (26) 

kaK 



It is now easy to finish the proof that Eqs. (9) and (15) =^> Eq.(16) in Theorem 1 by 
contradiction. Suppose that conditions (9) and (15) are true but Eq.(16) does not hold. 
Then, Eq.(9) implies Eq.(ll) and by the above lemmas, there exist two subsets J and K 
of {1,2, • • • ,n}, an integer j* G J and positive numbers x k , k G K such that Eqs. (25) 
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and (26) hold. 

Let Xj = for any j G J and x = (xi, X2, ■ ■ ■ , x n ) T . From Eqs.(25) and (9), we see that 

n 

M[c k ] > a kL x k + Y hjLXj > a kL x k + Y^kjL x j^ f or an V keK, 

n 

M[a] > > a iL Xi + Y Kjl x j, f or an V i e J- 

3=1 

Then, by Eqs.(l) and (9), it holds that 

x < (A L + B L y l c. (27) 

It follows from Eq.(26) that 

m[cj.\ < Y hkMX k < Y KkM X k- 

keK keK 

However, by conditions (9) and (27), we see that 

c > 5+ (A L + B L y l c > B+x. 
Therefore, by Eq.(6), it holds that 

M C i»] > S btkM x k = Y KkM^k- 
k^j„ keK 

This contradiction proves that Eqs.(9) and (15) => Eq.(16). □ 

The following lemma is a basic result to obtain a sufficient conditions of the global 
asymptotic stability of system (l)-(2). 

Lemma 8. For any two solutions {xi(t)}™ =1 and {yi(t)}2=i of the system (l)-(2), it 
holds Eq.(13). 

Proof. We have that 
(xi(t) - Xiit - Ti)) - (yiit) - y, t (t - n)) 

= / x'^sjds- / y'i(s)ds 

/I n m 

{ci(s) - Y, Yl a ^( s )%'( s - T k )}(xi(s) - yi{s))ds 

- T i j=lk=0 

/I n m 
Y Y2 X i( S ) a ij( S ">( X 3i S - T k) ~ Vj( S - T k))ds, 
--n 3=1 k=o 

t > to + n, 1 <i <n, < I < m. 

On the other hand, by Eq.(l), we have that 

X M-f§ ) --it-<3(t)Mt-r l )-y 3 (t-r l )) 
•M'v yiyi*) 3=11=0 

m 

- I>ii(t)[(zi(t) - &(*)) - iMt) - Xi(t - n)) - ( yi (t) - Vl {t - r,))}], 

1=0 
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from which we obtain Eq.(13). □ 

Now, consider the proof that Eqs.(9), (15) and (17) =>- Eq.(18) in Theorem 1. The 
proof relies on ideas already used by Gopalsamy [4], Tineo and Alvalez [10], Redheffer [8], 
Ahmad and Lazer [1], [2] and Muroya [7] (see also Teng and Chen [9]). 

Lemma 9. In addition to the conditions (9), (15) and (17), suppose that in the 
system (13)-(14), there exist positive constants a±, a 2 , ■ ■ ■ , ct n and rj > such that 

otian - Y^ a jUji >V, I <i <n. (28) 

Then any two solutions Xi(t), yi(t), 1 < i < n of system (l)-(2), satisfy the conditions 

lim (xi(t) — Vi(t)) = 0, 1 < i < n. 

Proof. For the positive constants «i, a 2 , ■ • • , a n in Eq.(28), consider a continuous 
Lyapunov-like (non- negative and non-differentiable) function v(t) such that for t > to, 

n m „i 



It, 11/ lit, ™ + 

v(t) ='£ i a i [\ln(x i (t)/y i (t))\ + EE / K( s + r /)IM s ) - Vj( s )\ ds 
i=l j=n=o Jt - T ' 

m ft rt n m 

+ 12 a ^( s + r ') / l Q ( M ) ~ 1212 a J( M )%'( M - T k )\\xi(u) - yi(u)\duds 

1=0 Jt - T i Js j=l k=0 

m ft ft n rn 

+ 12 / a ii( s + T i) / 1212 \ x *( u ) a ij( u )\\ x j( u - T k) - Vj( u - r k )\duds 

1=0 Jt ~ n Js j=l k=0 

m n m ,. t rs+r k 

+ 121212 (/ a l ii(u + Ti)du)\x i (s + T k )a k ij (s + Tk)\\x j (s)-y j (s)\ds]. 

l=0j=lk=0 Jt - Tk Js - T l+r k 

One can verify that 

x(t)a(x)(t) = \x(t)\ and D + \x(t)\ = a(x)(t)^^-, 



where 

I I 9 f r/f I S (1' nr nrif) — PI nnrl - 

dt 



if x{t) > 0; or x{i) = and -^ > 0, 



a(x)(t) = { 0, if x{t) = and ^§ = 0, 

-1, if x(t) < 0; or x(t) = and ^ < 

and D + denotes the upper right Dini-derivative. Since the signs of ln(xi(t)/yi(t)) and 
Xi(t) —yi(t) are same, it follows from Eq.(13) and the above that for t > t and 1 < % < n, 



D+\ ]n(xi(t)/ yi (t))\ < -ai(t)\ Xi (t) - Vi {t)\ +1212 I^WIM* " n) ~ Vj(t ~ n)\ 

j=l 1=0 

m „i n m 

+XX(*) / lfe( s )-EE a 5(%( s - r ^H^)-i/^)) 

1=0 Jt Tl 3=1 k=0 



3~- 
n in 

12 12 x i( s ) a U s )( x j( s - Tk ) - y^ s - r k))\ds. 

j=l k=0 

(29) 
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By Eqs.(9) and (15) => Eq.(16) in Theorem 1, Xi(t) and yi(t), 1 < % < n are bounded 
above and below by positive constants for t > to- Therefore, it follows that for any 
t > to + max n, v{t) < +oo. 

0<l<m 

Calculating the upper right Dini-derivative of v and using Lemma 2, Eqs.(28), (29) and 
(14), we obtain that there exists a sufficiently small e > such that < rj/2 < n e and for 
a sufficiently large t > t , Xi(t), yi(t) < Xi + e, 1 < i < n and 

D+v(t) 

n m ,.i n m 

<-J2<x i [{a i (t)-'£( aUs + TjWI^-^E <■(*)%■(* -Tfc)l}W*)"%(*)l 

i=l Z=0 t_T! j=l fc=0 

n m m rt+Ti. 

E E{l»ii(* + r ')l + E( / ' a U s + n)ds)\xi(t + 7*)a£-(t + r,)|}|^(t) - yj (t)\] 



3=1 1=0 



fc=0 



< -E( a ^ - E a i a yi x i(*) ~ v^)\ 



i=l 

n 



i¥=i 



<-5EW*)-ft(*)l. 



where 



i=l 






x - (E a iiM T i){a iM {xi + e) 



1=0 



n n 

+ max(-c iL + (a iM (xi + e) + E K( x j + e ))' c ^ ~ E Kj( x j + e ))}' 

3=1 ' 3=1 



d% 



-hjL + bt JM ){i + (E al uMn)(xi + e)}, j ± i, l<i< 



n. 



Ve = mm {anali - E «i«ji) > °- 



KKn 



Hence, the remained part of proof is similar to the proof in Gopalsamy [4] and Ahmad 
and Lazer [2]. Thus, we get the conclusion. □ 

We prove that the condition (17) implies Eq.(28) (see Lemma 2.9 in Muroya [7] and 
cf. Lemma 3.2 in Ahmad and Lazer [2]). 

Lemma 10. (See Berman and Plemmons [3]). The condition (17) holds, if and 
only if, there exist constants ctj > 0, 1 < i < n such that 



ctian — V^ ajOji > 0, 1 < % < n. 



(30) 



Proof. Let A = A = [dij], D = diag(6n, 0,22, • • • , d nn ) and B — I — D~ X A be n x n 
matrices, where da > 0, 1 < i < n and I is the n x n unit matrix. Then, it is well known 
(see for example, Berman and Plemmons [3]) that A is an M-matrix, if and only if, 



an > 0, 1 < % < n, B > and p(B) < 1, 
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where p(B) is the spectral radius of matrix B. 
Then, 

j A = D(I- B), A T = (/ - B T )D = D(I - D- l B T D), T)- l B T D > 
\ and p(D- l B T D) = p(B T ) = p(B) < 1, 

which implies that A T is also an M-matrix. 

It is also well known ( see Berman and Plemmons [3]) that A T is an M-matrix, if and 

only if, there is a diagonal matrix D = diag(ai, a 2 ,- ■ • , «„) such that 

Oi% > 0, 1 < i < n, and A D is a diagonally dominant matrix, 

which implies Eq.(30). 

Thus, we get the conclusion. □ 

By Lemmas 9 and 10, we complete the proof that Eqs.(9), (16) and (17) => Eq.(18) 
in Theorem 1. 

Now, consider the following Lotka-Volterra type competitive system with discrete 
delays (see Zhen and Ma [12]): 

f x '(t) = x(t){d - a n x(t - Tn) - a 12 y(t - t 12 )}, ,„i) 

\ y'(t) = y{t){c 2 - a 21 x(t - r 21 ) - a 22 y(t - r 22 )}, 

with initial conditions 

x{t) = Mt)>0, te [-r,0], 0i(O)>O, 

y(t) = <h(t)>Q, *e[-r,o], 2 (o) >o, 

r = maxJTjj}, 

where x(t),y(t) stand for densities of both the population at time t, respectively, Ci,aij 
are all positive constants, and Tij are nonnegative. 

To this system, in [12], Zhen and Ma have obtained the following result. 

Theorem A. (Zhen and Ma [12]). 
Suppose that 



and 
where 



a _^ > c A > a Jl ) (32 ) 

Q-21 c 2 a 22 



auTuMi < 1, z = 1,2, and A U A 22 - A 12 A 21 > 0, (33) 



A u = on(l - onTiiMi), A 12 = -oi 2 (l + 011T11M1) 

A 2l = -0 2 l(l + a227"22^2), ^22 = 022(1 - a 2 2T22^2) 



M x = (^)e cmi and M 2 = (^)e C2T22 . 

1 v an y z v a22 y 



Then the positive equilibrium (x*,y*) of system (31) is globally asymptotically stable. 
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To the system (31), by (32) implies (15) for Q = {1, 2} and the existence of the positive 
equilibrium (x*,y*) of system (31). Then, by the particular case in Theorem 1, we have 
that 

a u = An, dij = Aij, J f 1 1, % = 1, 2, (."4) 

and (17) is equivalent to (33). Hence, by Theorem 1, we obtain the same conclusion of 
Theorem A. Thus, Theorem 1 is an extension of Theorem A to nonautonomous cases of 
several species. 



Similarly, we consider the following time-dependent pure-delay-type Lotka-Volterra 
predator-prey model (see Xu and Chen [11]): 

' dxAt) 
— — — = xi(t)(ci(t) - au(t)xi(t - Tn) - a 12 (t)x 2 (t - t 12 )), 

dxo(t) 

— — = x 2 (t)(-c 2 (t) + a 2 i(t)xi(t - T 21 ) - a 22 (t)x 2 (t - t 22 ) - a 23 (t)x 3 (t - r 23 )), (35) 

dx (t\ 

1 = x 3 (t)(-cn(t) + a 32 (t)x 2 (t - r 32 ) - a 33 (t)x 3 (t - r 33 )), 
at 

where Cj(t) and Q>ij(t), 1 < i,j < 3 are continuous, bounded, and strictly positive func- 
tions on [0, +oo), and r„, 1 < i, j < 3 are nonnegative constants. 



To this system, in [11], Xu and Chen have obtained the following result. 



Theorem B. 

Assume 



(see Xu and Chen [11]). 



a 32M {a 2 iMCiM exp(rnCiAf) - c 2L a nL ) exp{( a21 a ^ 1M exp(rnCi M ) - c 2L )t 22 \ 

-ailL0 22 LC 3 L > 0, 
a 32L( a 21LXl — C 2 M — a 23MX 3 )/a22M 

x exp{(a 2 i L Xi - c 2M - a 22M ^2 - 023^3)^2} - c 3M > 0, 



where 



X\ 

x 3 



<-lM 



X 



X 2 
i^3 



nu , , , , , , „ , _ a2iA Z] C2L exp{(a 2 i M a:i - c 2L )r 22 }, 
a32M a x ;~ CiL exp{(o 32M x 2 - c 3L )r 33 }, 

ClL-ai2M^2 

a llM 
a 21L^i-C2Af- Q23M^3 

a 22M 
a 32L^2~ c 3M 

a 33M 



exp(rnCiM), x 2 
exp{(a 32M 
exp{(c iL - a 11M x 1 - a 12M x 2 )Tu}, 

exp{(a 2 i L ^i - c 2M - a 22M x 2 - a 23M x 3 )T 22 ], 



■>'■>. = ~^== — — exp{(a 32L ^ 2 - c 3M - a 3 3M^3)r 3 3}, 
and suppose further that the system (35) satisfies 

A t >0, i = 1,2,3, 



where 



UiiL — a iiLX_i, t>ijM — a ijAlXi, I — 1, •£, O, 

A\ = 2bm — b\ 2 u — b 2 iM — &iim(26hm + &i2m)Tu — &21M&22MT22, 
A 2 = 2b 22 L — b\ 2 M — b 2 \M — b 23 M — b 32 M — buMbuMTn 

— b 2 2M{b 2 lM + 26 22 m + b 23 u)T~22 — &32M&33MT33? 
A 3 = 2b 33 L — b 23 M — b 32 M — b 22 ub 23 M T 22 — ^33m(&32M + 26 33 m) t 33- 



(36) 



(37) 



280 Y.MUROYA 



Then any positive solution of system (35) is globally asymptotically stable. 

To the system (35), we apply Theorem 1, then we obtain the following result. 
Corollary 1. Assume (36) and suppose that 

J 611,622, 633 > 0, and 



, -. - . , . , = = (3* 

O12O21O33 + 023 a 320ll < Oll022033, 



where 



and 



exp(rnCi M ), x 2 = a21M f' T C2L exp{(o 2 iMXi - c 2 l)t 2 2}, 



C 1M 

g32i gf^ exp{(a 3 2MX2 - c 3L )r 33 }, ■ ' 



and 



flu = OnL - (oiimTii){ohm^i + max(ci M , -c 1L + 012X2)}, 

022 = 2 2L - (022Mr22){a 2 2M^2 + Hiax(-C 2 L + a 2 lM^l, C 2 M + 023^3)}, (40) 

k 633 = o 33 l - (a 33 Mr 33 ){a 33 M^3 + max(-c 3L + a 32 MX 2 , c 3M )}, 



«12 — Ol2M(l + OllM^l), 021 — 021m(1 + 022M^2), ,. -, \ 

«23 — «23m(1 + «22M^2), O32 = «32m(1 + 033M^3)- 



Then any positive solution of system (35) is globally asymptotically stable. 

Proof. Eq.(36) implies the permanence of system (35) and hence Eq.(16) holds (see 
Xu and Chen [11]). Since 3x3 tridiagonal matrix A = [oy] is an M-matrix, if and only 
if, 011,022,633 > > 612,021,623,632 and (612/an) (621/622) + (623/622) (6 32 /6 33 ) < 1, for 
this system (35), we can easily prove that Eq.(17) holds if and only if Eq.(38) holds (see 
for example, Berman and Plemmons [3]). Thus, by Theorem 1, we obtain the conclusion. 
□ 

Note that Eqs.(37) and (38) are different type conditions from each other. 
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ABSTRACT. In the first part of this article, a stochastic model for the average case 
analysis of roundoff errors is developed. Starting from the fractional roundoff error 
of an algebraic operation a Wilkinson type model Q(X o Y) = X o Y + e°\X o Y\ 
for the relative roundoff error e° is derived for real valued operations in §1 and for 
complex operations in §2. The analysis of alternative implementations of complex 
multiplication shows that the standard method has roundoff errors of the smallest 
average size. In §3 this model is applied to the error analysis of the matrix-vector 
product. The average Euclidean norm of the roundoff error vector as well as the 
covariance between its components is determined. In the special case of Givens 
rotations these components are uncorrelated and the same holds for reflections. For 
rotations, the standard method of the matrix-vector product is compared with the 
representation by three lifting steps which enjoys roundoff errors of smaller average 
size if the rotation angle is small, however at the expense of a correlation between 
the components. 

In the last part, these results are applied to the discrete Fourier transform and the 
discrete cosine transforms of types II and III in the case n = 8. Taking into account 
the precomputation errors of the coefficients of the trigonometric matrices, both the 
direct method and a number of fast algorithms are studied. The latter show roundoff 
errors of only slightly larger average norm. In all algorithms the components of the 
roundoff errors are either uncorrelated or have covariances of only small size. The 
results are confirmed by simulations. 



0. Introduction 

When an algorithm which uses a large number of floating point operations is 
implemented, it is important to analyse the size of the roundoff errors in order to 
ensure the accuracy of the result. By a worst case analysis an upper bound of the 
error is obtained. Usually, this bound is much larger than the "typical" roundoff 
error of the algorithm: whereas the worst case error grows like the number of steps 
in the algorithm, in simulations often an error is observed which is proportional 
to the square root thereof (see [7], pp. 52-53). When we have to choose between 
different algorithms for the same problem, then in the case of equal complexity the 
average size of the errors would be more conclusive than a theoretical upper bound. 

The average case analysis attempts to obtain a more realistic estimate for the 
roundoff errors by applying a stochastic model for these errors. Certainly, roundoff 
errors are not random, since the computer will produce the same result again, and 
hence the same error, when an operation is repeated with the same operands. But, 
as in other areas of applied mathematics, we develop a model of the roundoff errors 
which well describes the effects observed in the data. It will turn out at the end 
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that the predictions of our stochastic model for the roundoff errors are in good 
accordance with the results of simulations. 

In the early days of the computer, average case analysis of roundoff errors has 
been mainly used to evaluate the benefits of different hardware concepts and imple- 
mentations of the basic floating point arithmetic (see [3]). For special distributions 
of the data, the statistical properties of the roundoff errors in arithmetic expres- 
sions have been studied. Applications to the a posteriori average case analysis of 
the fast Fourier transform (FFT) are proved in [4] and [1] for the Cooley-Tukey 
resp. Gentleman-Sande algorithms - under the assumption that the roots of unity 
used in the FFT have been precomputed exactly. It turns out that these precom- 
putation errors have a determinative influence on the accuracy of the result (see 
[11]). In [13], an average case error analysis for the Cooley-Tukey FFT is given 
which includes the treatment of the precomputation errors. 

In the first half of this article we lay the ground for a simple but realistic model 
for the a priori average case analysis of the roundoff errors in floating point arith- 
metic. It is valid both for real and complex valued data. In Section 1 we derive 
a Wilkinson-type (but stochastic) model for the relative error in the case of real 
valued arithmetic, which is based on properties of the fractional error as defined 
in [3]. For multiplication (and division) the distribution of the relative error can 
be given in closed form. The dependency from the distribution of the data is only 
very small. In contrast, for summation, the distribution of the data has a strong 
influence on the distribution of the relative error. A formula for the expectation of 
the relative error is proved under various rounding modes, and it is shown that the 
expectation is zero for the default rounding mode in the IEEE 754 standard. 

In Section 2 properties of the relative roundoff error e° := °\zlw\ — ^ or com ~ 
plex operands Z, W are derived from the stochastic Wilkinson model in Section 1. 
For all operations o, the real and the imaginary parts of the error are uncorrelated 
and have symmetric distributions with equal variances. For addition and multi- 
plication the variance of the complex error does not depend on the distribution of 
the data, but only from the first two moments of the real valued relative errors. 
In the case of division there is only a weak dependence on the distribution of the 
data. Alternative methods for complex multiplication are shown to have inferior 
accuracy. 

In the second half of the article the stochastic model for the actual roundoff 
error is applied to matrix-vector multiplication and the discrete Fourier and cosine 
transforms (DFT and DCT) in particular. In Section 3 we state our final model 
which is based on the preliminary models of Sections 1-2. The main assertion of 
the model is that the relative roundoff error e° has expectation and variance o~^ u 2 
and is independent of the data. The constant o~ 2 only depends on the operation 
o and the underlying number field K = 1 or C but not on the machine accuracy 
u (see (1.1)). Motivated by the results of section 2 we also assume that a 2 is not 
dependent on the distribution of X and Y, as long as the data has expectation and 
is uncorrelated. Based on this model, we determine the variance of the roundoff 
error for the inner product of two vectors and the product of a matrix with a vector 
under different summation methods. It is possible to generalize this to data with 
non-zero expectation, but the statement of the result is more complicated. 

If entries of the matrix carry precomputation errors, several additional problems 
have to be considered: in many practical examples, the same constant occurs in 
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several entries of the matrix. The corresponding roundoff errors are clearly de- 
pendent. Moreover, multiplications with identical operands - and hence identical 
errors - enter in different components of the result, for example in the butterfly op- 
erations of the FFT. As a consequence, the components of the total roundoff error 
of a matrix-vector product are no longer independent. A general formula for the 
covariance matrix of the error vector in this situation is the main result of Section 
3. If we are only interested in the mean square norm of the error vector (and not 
in the componentwise analysis), the result can be simplified considerably. 

In the Section 4 we apply this general result to the discrete Fourier transform, 
the fast Cooley-Tukey and Gentleman-Sande algorithms, the discrete cosine-II and 
-III transforms with two new fast algorithms from recent research by Loeffler et al. 
[9] and Schreiber [12]. We restrict the study to the special case n = 8, since here 
the effect of correlation between error components and non-equal variances is more 
pronounced than for larger n. In all these examples, we observe that the correla- 
tions between the components of the error vector - if non-zero - are always small 
compared to the variances, but that the size of the variance of different components 
can vary considerably. The theoretical results are confirmed by simulations. 

1. Wilkinson's roundoff error model revisited 

The standard representation for roundoff errors occuring in computations with 
floating point numbers is Wilkinson's model, which gives the following estimate of 
the computed value of an operation o £ {+, — , x, -=-} 

fL(xoy) = (xoy)(l + e°) (x,y€R). (1.1) 

The relative error e° satisfies the worst case bound |e°| < u, where u is the maximal 
relative machine error of the implementation of the arithmetic operations (depend- 
ing on the CPU). If the mantissa in the floating point representation of a real 
number uses m + 1 bits (not including the sign), then u = 2 _m_1 . Here and in the 
following we will neglect the possibility of an underflow or overflow. 

This model is based on the assumption that the computed result of an operation 
is obtained by rounding the precise result z = x o y (or at least a more precise 
result) towards the nearest machine number 

&(z) = z(l + e) (|e|<«). 

A sharper and more realistic (but more complicated) model which is based on the 
representation of floating point numbers, follows from the assumption 



with 



&(z) = z + ((z)r) (\v\<u) 

sgn(z) max{2 n < \z\ : n G Z} for z/0, 
for z = 0. 



Here, r/ is the fractional error (see [3]). It implies 

fl(x o y) = x o y + Q[x o y) rf (x, y £ K, \rj°\<u) (1-2) 
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The difference between the implications of these models, however, is only small (see 
Remark 2.2). In the rest of the Section 1 our results will be based on the model 
(1.2) and we will study the consequences for (1.1). 

Following Calvetti [4] we begin to build a stochastic model for the roundoff errors 
by assuming that the relative errors e° in (1.1) from an operation o are random 
with expectation 

E(e°)=/i n (1.3) 

and variance 

V(£°) = a 2 u 2 . (1.4) 

The constants n o ,0~ o £ R will depend on the operation o £ {+, — , x, -r} and the 
distribution of the data. In practice the result of a particular computation, and 
hence the roundoff error, is (hopefully) not random. Therefore we explain the 
randomness of e° as a consequence of the randomness of the input data. In this 
model we will assume that the input is random and that the relative error e° is 
independent of the data. Our simulations are in accordance with relative error and 
data being uncorrelated, but since the roundoff error is a function of the data, our 
stronger assumption of independence is certainly a simplification. It is motivated 
by the observation that the size of the relative error e° is not related to the size of 
the data, and that the last digits of the data, which have the strongest influence on 
the roundoff error, are approximately independent of the rest of the mantissa. 

Calvetti [4] has observed that in numerical experiments ^+ < is obtained for 
the addition, whereas fi x is close to zero. In her simulations the roundoff error is 
determined by rounding the exact result to a given precision. In our own exper- 
iments (using MATLAB on a Macintosh computer and an SGI workstation, and 
verifying 2 • 10 6 calculated values using long arithmetic) no statistically significant 
evidence against the hypotheses /i + = and fi x = was found (see Remark 1.8). 
It is clear that the implementation of the basic arithmetic operations in different 
hardware or software will influence the distributions of the roundoff errors. It is 
therefore possible that the "drift" towards zero for the addition observed by Cal- 
vetti originates from the treatment of ties in her model, i.e., a result x o y which is 
exactly the mean between two consecutive machine numbers. In her experiments 
these are rounded towards the smaller neighbour by the usual round function. Fol- 
lowing the IEEE 754 standard (see [7], pp. 46-48), an arithmetic operation is 
performed to arbitrary precision and then rounded towards one of the adjacent ma- 
chine numbers using one of four methods. In the default rounding method, ties are 
rounded towards the nearest even mantissa number. Lemma 1.3 below will explain 
the discrepancy between Calvetti's model and ours. 

In the following we will study the probability p° of ties for addition and multi- 
plication and its influence on the expectation fi°u of the roundoff error. We denote 
by m + 1 the number of bits in the mantissa (not counting the sign bit). The 
probability p + of a tie in the computation of a sum (or difference) depends strongly 
on the distribution of the operands. 

1.1 Lemma. Let X and Y be independent with 2 : >~ 1 < X < 2 J and 2 k ~ 1 <Y<2 k 
(j, k £ Z). Assume that the \j — k\ + 1 last digits in the binary representations are 
independent of the rest and uniformly distributed. Then the probability of a tie in 
the calculation of X + Y is 

p+ = 2 -l?- fc l- 1 (l + P{X + Y < 2 max (j< fe )}). 
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The calculation of X — Y yields no roundoff error if j = k, and the probability of a 
tie is 

+ J \P{\X -Y\> 2 max 0»- 1 } for j = k ± 1, 

'" " [ 2~\i- k \{l + P{\X -Y\ < 2 max 0'' fc )- 1 }) for\j-k\>2. 

From these special cases the probability p + of a tie for arbitrary distributions of 
X and Y can be deduced. We observe that the probability of a tie can be as large 
as 50%, but very small probabilities are possible if X and Y are of very different 
size. Note that the assumption of independence is at least approximately true as 
long as \j — k\ is much smaller than m. Furthermore, the different behaviour of 
addition and subtraction stems from the assumption X, Y > 0. 

Proof Without loss of generality we may assume that k < j = 0. Let X\ and 
Y\ be the Z-th digits in the binary representations of the mantissae of X and Y . 
Then we have a tie in the computation of X + Y if and only if X + Y > 1 and 
X m +i + Y m+ i + k = l,Y m+2 +k = ■ ■ ■ = Y m+ i = 0, or if X + Y < 1 and Y m+2 +k = 
1, Y m+ 3 + k, . . . , Y m+ i = 0. Since we have assumed independence, the probability of 
the first event is P{X + Y > 1} 2 k ~ 1 and P{X + Y < 1} 2 k of the second. Note 
that X + Y < 1 is only possible for k < — 1. The sum of these probabilities is 
2 fe_1 (l + P{X + Y < 1}), and so the first equation follows. 

For k < —1 a tie in the difference X — Y occurs if and only if \X — Y\ > \ and 
Ym+2+k = l,^m+3+fc) • • • , Ym+i = 0, or if A; < —2, \X — Y\ < ^ and Y m+3+ k = 
1, Y m +4+fc, . . . , Y m+ \ = 0. This implies the last two equations. ■ 

1.2 Lemma. If the density of the mantissa of X resp. Y (i.e. the probability that 
the mantissa is a particular number) is bounded by a/2 m+1 resp. b/2 m+1 , then for 
the multiplication the probability p x of a tie satisfies 

v - 8 

For example we can choose a := \\fx\\oo an d b := ||/y||oo if X and Y are the 
machine approximations of random variables with densities fx resp. fy and values 
in [|, 1]. In particular if X and Y" are uniformly distributed in [|, 1], [0, 1] or [— 1, 1], 
then a = 6 = 2. 

Proof. Without loss of generality let | < X, Y < 1. Let L resp. M be the number 
of trailing zeros in the mantissa of X resp. Y. Then 



^ + 1 1 - om /.• I .. /o/» . _ o /.• -2 



^ O 7T1. — fc — 1 



and P{M = A;} < 2~ k ~ 2 b for fc = 0, . . . , m — 1. A tie occurs if and only if 
L+M = m — 1 and J7 > |, or if L+M = m — 2 and X7 < | (after normalization). 
Therefore the probability of a tie is bounded by 

P{m -2<L + M<m-l} 

= Y^ P{ L = k}P{m -k-2<M<m-k-l} 

k=0 

771—1 

<^2- k - 2 a{2- m+k +2~ m+k - l )b = 3mab2- m - 3 . 

k=0 
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This completes the proof. 



1.3 Lemma. Let o 6 {+,-, x, -=-} and q := E(((XoY)/(XoY)) (see (1.2)). Then 



Mo 



1 for nearest neighbour rounding with ties rounded to even, 

-p°q for nearest neighbour rounding with ties truncated, 
-q for truncation. 



The constant q depends on the distribution of the data, but only slightly: if XoY 
is standard normal, then q = ^vr)" 1 / 2 Y, n(z i 2~ n (exp(-2 2n - 1 ) - exp(-2 2n+1 )) « 
0.721; if X o Y is uniform on [-1, 1], then q = In 2 « 0.693. 



Proo/. Let Z := J o y and i? := Z/2((Z) - u[Z/2u((Z)\ the part of the man- 
tissa which would be lost with truncation. Since < R < u, the distribution 
of R is symmetric around u/2 (with the exception of R = 0) and hence the first 
result follows. For truncation we have e° = —2R((Z)/Z and /j, u = E(e°) = 
—2E(R)E(£(Z)/Z) = —uq by the assumption of independence between relative 
error and data. For the second rounding method we have to replace E(R) by 
E( J Rl { i ?=u/2} ) = fP{ J R=f}=p°f. ■ 



In the following we will assume that the default rounding method of IEEE 754 
standard is used. Hence fi = 0. For summation the error distribution depends 
strongly on the distribution of the data (see Lemma 1.1 and the following his- 
togramms of roundoff errors in Figures 1-2) 
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Figure 1: Distribution of roundoff errors e^/it from summation of standard Gaussian data (left) 
and data distributed uniformly on [1, 2] (right), with 100000 samples. 



In contrast, one observes that the distribution of the relative error for the multipli- 
cation has a very similar shape for a large class of input distributions. Simulations 
show that this shape can very well be described by a density: 
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Figure 2: Distribution of roundoff errors e x /u from multiplication of standard Gaussian data 
(left) and data distributed uniformly on [1,2] (right), with 100000 samples. 

However, roundoff errors according to our definition are always discrete random 
variables. 

In order to avoid this problem, we assume that the input variables are continuous 
with a smooth density, and rounded to machine number after multiplication. Then 
the roundoff error ft.(XY) — XY also has a density. This modification is motivated 
by the fact that for uniformly distributed X and Y the remainder R (see Lemma 
1.3) takes the value uj/2 l with the same probability for all odd j = 1, . . . , 2 l — 1 
(I = 2, . . . , m + 1). Therefore the uniform distribution is a good approximation for 
the distribution of tL(XY) - XY. 

1.4 Proposition. Assume that X and Y are independent with ^ < X, Y < 1 

3o < 1. Furthermore, let 



and that X has a differentiable density 4> with u\\ 

truncated mo 
yl.l) has the density 9 on [—1,1] given by 



tp(t) := E(yi;y< t }) be the truncated moment. Then the relative error e x /u in 



9{t) = E(X)E(Y)+ I i/>(—)x<f>(x)dz 
Ji/2 \2x/ 



for \t\ < o and 



9{t) = E(Il {2 | t | X < 1} )E(y) 

- 1/2 1 1 



1/2 



ip[ — )x(j)(x) dx 
\2xJ 



+ 2 



f / ]\ r 1 / 1 \ 

1p[- — r-r ) X(j)(x) dx + / Ipl 7-r}X(j){x) dx 

1/2 \Ax\t\J h/2\t\ V2x|t|/ 



for\t\>\. 

The assumption u\\4>' 
approximated by 9 with a 0{u\ 



^C 1 means that the exact density of e x ju can be 



-error term. 



Proof. Under our assumptions the conditional distribution of Q.(XY) — XY given 
X,Y is the uniform distribution on [-§ , f] for XY > \ and [-f , f] for XY < 



|. Hence e x ju has the conditional density XY\\ \ i_i for XY > \ and 

2XYl< i i , 

L 4X Y ■ 1AT - 



2XY ' 2XY ' 



for XY < -=. This holds up to an error of at most u\ 
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Therefore the value 9{t) of the density of e x /u is 



// 

J J X 



^xyl [0 i ] (xy)P v (dy)(f)(x)dx+ // xyl [0 i ] (xy)P Y (dy)4>(x)dx 



a;y<i J J xy>\ 

pi. 11 /* 1 1 1 

2 / 1/ /( min (^'4^))^ (x)da;+ y 1/2 k(^)-^(^ 



x<j){x)dx. 



For |£| < i the value of the minimum in the integral above is ■£- and ^(ottt) = 
ip(l) = E(y). For |t| > g the minimum is ^^ur and for 2x\t\ < 1 we have VK^^nr) = 
E(Y). In both cases the result follows by a simple calculation. ■ 

1.5 Corollary. Under the same assumptions as in Proposition 1.4 Zei x(£) := 
K(Y~ 2 l{Y<t})- Then the variance of the relative error e x in (1.1) is given by 

1.6 Example. If X, 1" are uniformly distributed on [1/2,1] (or [0,1] or [—1,1]), 
then the density of e x /u is equal to 

r | + 5 In 2 « 0.722 for \t\ < \ , 

m= \^ + ( l -¥ + l)(l-l) forh<\t\<l. 



The standard deviation is a x = w| In 2 — „- ~ 0.424. 

1.7 Remark. For the division of real numbers, Proposition 1.4, Corollary 1.5 and 
the density in Example 1.6 are also valid, since the same arguments are applicable. 

1.8 Remark. By experimenting with standard normal data X, Y in MATLAB we 
have obtained the following values as multiples of the machine accuracy u (which 
is u = ^eps = 2" 53 for MATLAB): \fi + \, \/i x |, |/z_=_| < 0.001, a + « 0.406 and 
a x ~ 0.425 ~ U-=-. The results in [4] about /j, + and fi x are explained by Lemmas 
1.1-1.3. 

For uniform data on [0, 1], <7-|_ ~ 0.470 was obtained and a+ ~ 0.192 for the uni- 
form distribution of [—1, 1]. The influence of the distribution on the roundoff error 
from summation is obvious. For multiplication no such effect has been observed. 

1.9 Remark. The subsequent results remain valid even when fi + ^ or /i x ^ 0. 
However, if we replace the definition of the relative error in (1.1) by 

R(XoY) = XoY + e°\XoY\, (1.5) 

then it is easy to see that E(e°) = and V(e°) = a% + nl for any operation o e 
{+, — , x, -=-}, as long as |XoY| and sgn(XoY) arc independent and fl(— x) = —&(x) 
for all x G R. 
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2. Complex arithmetic 



Starting now from the stochastic version of Wilkinson's model (1.1) for real 
valued arithmetic operations, we will derive properties of the distributions of the 
relative roundoff errors for operations with complex data. These will be stated in 
the form of (1.5) which is more convenient for later use. 

2.1 Proposition. Let X, Y, U, V be independent real-valued random variables with 
identical distribution symmetric around the origin. Set Z := X + \Y , W := U + iV. 
Assume that 

&(X + U) = (X + U)(l + ef), 
&(Y + V) = (Y + V)(l + e+) 

where £f,£% are independent of Z and W and with E(e^) = /U+u, V(e~^) = a+u 2 
for j = 1,2. Define £ + by 

fl(Z + W) = (Z + W) + £ + ■ \z + w\. 

Then we have: 

(i) The distribution of £ + is symmetric. In particular, E(e + ) = 0. 

(ii) Ree + , Ime + and \Z + W\ are uncorrelated. 

(hi) We have V(e + ) = (//+ + c+)u 2 and Ree + and Ime + have equal variances 

V(Ree+) = V(Ime+). 

Proof, (i) From the definition we have 

+ _ (X + U) • £+ + \{Y + V) • 4 



^f{x + uy + {Y + v) 2 

By independence and the symmetry of the distributions of X, Y, U and V the dis- 
tribution of —£ + = - — . £l = £?- is the same as that of e + . 

y/(-X-U)' + (-Y-V)2 

(ii) WehaveCov(Ree+,|Z+VF|) = E((X+U)-ef) = E(X+U)-E(e+) = 0-//+U = 
and similarly for Cov(Ime + , \Z + W\). Furthermore 

Cov(Ree + ,Ime + ) 

= E ( ^±^ e+ ^±^ £ + 

\^(x + uy + (Y + vy x ^(x + uy + iY + vy 2 
-n +u ^y {x + u)2 + {Y + vy 

again by symmetry. 

(in) Since E(e + ) = 0, we have 

\2 



iwTWTWTw)- E{{4f ^> l + 



+^ --'- 2 ^ai)u 2 . 
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2.2 Remark. A more realistic model would be obtained by replacing the ef from 
(1.1) with the fractional relative roundoff error i]f from (1.2) in the assumptions in 
Proposition 2.1. Since by definition E((r?+) 2 ) = E((e+) 2 ) E((X + U) 2 /((X + U) 2 ), 
we obtain 

v , - F ( ax + u) 2 + aY + v) 2 \ nx + u) 2 \ 

instead of the result in (iii) above. The product of the two expected values (which 
depends on the distribution of the data) is very close to 1 for a large class of 
distributions. This justifies the use of (1.1) in the following. 

In the next proposition we study complex multiplication. Here and in the fol- 
lowing we assume that roundoff errors propagate without further alteration (i.e., 
they have no influence on further roundoff errors). 

2.3 Proposition. Let X,Y, U,V be real valued random variables such that X and 
Y have the same distribution which is symmetric around the origin, and set Z := 
X + \Y, W :=U + iV. Assume that 

Q(X x U - Y x V) = {X x U - Y x Y)(l + ef) + (X x E/>* - (Y x V>£ 

and 

Q(X x V + Y x U) = {X x V + Y x 17) (1 + e+) + (X x Y)e x + (Y x C/)e£ 

mt/i X, Y, IV, ef ,££ ,£3 ,£4 , e^e^ independent and E(e x ) = /x x u, V(e*) = <t 2 u 2 
(j = 1,2,3,4) and E(et) = ^ +w , V(e+) = a 2 u 2 (j = 1,2). //we de/me e x by 

&(Z x W) = Z x W + e x \Z x W\, 

then we have: 

(i) The distribution of e x is symmetric. In particular, E(e x ) = 0. 

(ii) Ree x , Ime x and \Z xW\ are uncorrelated. 

(iii) Ree x andlme x have the equal variances V (Re e x ) = V(Ime x ) and V(e + ) = 

{o-i+a\ + nl+ix\)u 2 . 

Proof, (i) This follows from 

Ree x = ; • (ef + ef) ; • (e, x + ef) 

VX 2 + Y 2 VU 2 + V 2 V x : ^ v / ^ 2 + Y 2 v / f/ 2 + Y 2 V 2 x 7 



and 



AY £+ YU 

VX 2 + Y 2 VU 2 + V 2 3 + £2 + v 7 ^ 2 + Y 2 VC/ 2 + Y 2 



Ime x - j== == ,__„ = • (e£ + ej) + ,__„ == ,__„ = • (e% + e^ 



as well as the fact that the distribution of e x = Ree x + ilme x remains the same, 
if we replace (X, Y) by (-X, -Y). 
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(ii) We have Cov(Ree x ,|Z x W\) = E(XUe? - YVe% + (XU - YV)ef) = 
E(X)E(U)E(sf) -E(Y)E(V)E(e%) + (E(X)E(U) -E(Y)E(V))E(ef) = and sim- 
ilarly for Ime x . Furthermore, Cov(Ree x ,Ime x ) equals 

/((XU - YV)ef + XUe* - YVe^) {{XV + YU)e% + XV e$ + YUe% 

E \ (X 2 + Y 2 )(U 2 + V 2 ) 

The terms going with E(e^)E(e^") are 

/ X 2 \ ( UV \ ( XY \ ( U 2 

e( — n . ^ m -= . -, ) + E(— — - )E 



X 2 + Y 2 J \U 2 + V 2 J \X 2 + Y 2 J \U 2 + V 2 

XY \ / V 2 \ ( Y 2 \ / UV 

E — - — — T ]E( — — tttt) - E(— - — — r)E( 



X 2 + Y 2 J \U 2 + V 2 J \X 2 + Y 2 J \U 2 + V 2 

and by our assumptions the first and the last terms cancel, whereas the two middle 
terms vanish because of E( x2+Y ^ ) = 0- Here the symmetry of the distribution of 
X is used. The same terms also occur in conjunction with E(e x )E(4) = fj^u 2 
(where j = 1,2, k = 3,4) and with the mixed terms E(4)E(4) = /j> + /j, x u 2 (where 
j = 1, k = 3, 4 as well as j = 2, k = 1,2) and therefore Cov (Re e x , Im e x ) = 0. 
(iii) This is a similar calculation as in (ii). ■ 

2.4 Remark. It is possible to use only three multiplications (but five additions) 
by defining a := (X + Y) x (U — V), (3 := X x V, 7 := Y x U and using the identity 

(X + iY) x(U + iV) = (ct+(j3- 7)) + i(/3 + 7)- 

Whereas the usual complex multiplication is based on the formula 

XU -YV\ _ f X -Y\ fU 
XV + YU J ~ \Y X )\V 

with an almost orthogonal matrix, the three-plus-five algorithm relies on 

XU-YV\ (I 1 -1\ ( X+ J I ° X /l ' 
XV + Yu)-{0 1 l)[ I X Q 

with non-orthogonal matrices. Therefore it is not surprising, that in terms of 
numerical stability this method is worse than the usual method. In fact, under the 
same assumptions on the data as in Proposition 2.3 one can prove that the relative 
error 

_ fl((q + Q3- 7)) + i(0 + 7)) ~{X + xY){U + IV) 
\{X + iY)(U + iV)\ 

^ q(e+ + et + 4 + 4) + gg| ~ 74 + (P ~ 7)(4 + 4) 
\(X + iY)(U + iV)\ 

+ 1 \{X + iY)(U + iV)\ +U ^ U > 
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(where e* , . . . , £5 are the relative roundoff errors from the successive computations) 
satisfies E(e x/ ) = 0, 



V(Ree x/ ) 

V(Ime x ') 



3<7+ + -<Tx + 5^+3/i+/i x + - / u 2 x )n 2 + 0(u 3 ), 



2^+ 



Cov(Ree x ',Ime > 



+ 0- 2 . + ^+^ 2 x )u 2 + 0(\ 
= 0(u 3 ), 



is 



and so in particular E(|e x ' | 2 ) = (|o- 2 r + 2a 2 + ^-fi 2 + + 3^ +/ u x + /x 2 )u 2 + 0(u 3 
larger than for the usual computation of the product of two complex numbers (see 
Proposition 2.3 (hi)). 

By reversing the role of real and imaginary parts we can obtain an algorithm 
with the same complexity and similar error variance. Yet another method with 
the same complexity is described in [10], p. 73 and based on XU — YV = {X + 
Y)U - Y{U + V), XV + YU = {X + Y)U + X(V - U). It leads to a complex 
roundoff error with equal variances of real and imaginary parts but with covariance 



+ a\ + /4 + ^ 2 )-u 2 + 0(u 3 



Therefore they are not uncorrelated as in 
The total variance of the relative error is 



the case of all the other algorithms. 
(3ct 2 _ + 2ct 2 + 3/4 + 2^ 2 )u 2 + 0(u 3 ). 

2.5 Remark. In the following diagrams of Figure 3 we show the results of 10000 
simulated multiplications of complex numbers using the methods from Proposition 
2.3 (left) and Remark 2.4 (right diagram). 



-2 2 -2 2 

Figure 3: Distribution of roundoff errors e x ju for different methods of complex multiplication 

As can be seen from these diagrams the distribution of the complex valued 
relative roundoff error e x is not exactly rotation symmetric, even when Z and W 
are. This is also true for e + . 



If one of the factors of the product is a real number (or purely imaginary) then 
only two real multiplications are needed to calculate the product. As a consequence, 
the roundoff errors are smaller than in the general situation. 

2.6 Proposition. Let X,Y,U be independent real valued random variables such 
that the distributions of X and Y are equal and symmetric around the origin, and 
set Z := X + \Y . Assume that 



&(U x X) 
R(U x Y) 



(C/xX)(l + e x ) 
(£/xY)(l + e x ) 
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with e* ,£$ independent of Z and U and with E(e x ) = n^u, V(e?) = o\u 2 (for 
j = 1,2), and define e x by 

&(U x Z) = (U x Z)+e x \U x Z\. 

Then we have: 

(i) The distribution of e x is symmetric. In particular E(e x ) = 0. 

(ii) Ree x , Ime x and \U X Z\ are uncorrelated. 

(iii) Ree x and Ime x have equal variances V(Ree x ) = V(Ime x ), and V(e x ) = 

(M 2 x +o- 2 x )u 2 . 

The proof is similar to the preceding ones and will be omitted. 

Finally we study division in C We will restrict our attention to the roundoff 
errors in the computation of the inverse 1/Z, where Z is a complex variable such that 
Z/0 almost surely. Unlike the cases of addition and multiplication the variance 
of the roundoff error is only approximately independent of the distribution of the 
data. 

2.7 Proposition. Let X, Y be independent with equal and symmetric distributions 
such that P{X = 0} = 0, and set Z := X + \Y . Assume that 

/ X \ X + 

\X 2 + Y 2 J ~ X 2 e* + Y 2 e x + (X 2 + Y 2 )(l + ef) ( +£lh 

Y \ _ Y _5_ 

X 2 + Y 2 ) ~ X 2 e* + Y 2 £ x + {X 2 + Y 2 )(l + e+) (1 + £ * } 

with X, Y, £*,£$ ,£i ,£i,ej independent, E(e x ) = /J- X u, V(e x ) = (j\u 2 , E(et) 
= n+u, V(e+) = cr 2 ^ 2 /or j = 1,2 and E(ef ) = //_=_«, V(ef ) = a\u 2 . If the 
relative complex roundoff error e~ is defined by 

flf 1 ) = - + £ A 

\z) z \z\ 

then we have: 

(i) The distribution of e~ is symmetric. In particular, E(e~) = 0. 

(ii) Cov(Re^,Ime-), Cov(Ree+,l/\Z\), Cov(Ime-, 1/\Z\) = 0(u 3 ). 

(iii) LetK-.= E((X 4 + Y 4 )/(X 2 + Y 2 ) 2 ). Then V(Ree^) = V(Ime-) and 

V(e" 5 ") = (/so- 2 , + cd + cr+ + /i 2 + /ii + /4 + 2^ x ^+ - 2( / u x + n+)^)u 2 + C(u 3 ). 



Proof. Let o" := — x^+y 2 2 "^ e i • Then it follows from the definition of e • that 



x 2 +y 2 

Ree " = Vx f +y . (4 - *) + o(^ 2 ) and lm ^ - vx ".+ y . (4 - <*) + o(^ 2 )- 

(i) This is a consequence of the symmetry of the distributions of X and Y. 
(ii) The covariances satisfy Cov(Ree~,Ime~) = E( ^^fy 2 (ef" — <5)(eJ — (5)) + 
C(u 3 ), Cov(Ree",l/|Z|) = E( x2 * y2 (e^ - <5)) + C(u 3 ), and the expected values 
are by the symmetry of the distributions of X and Y. 
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(iii) Obviously, V(Ree-) = E( x2 * 2 y2 (gf - 5) 2 ) + C(u 3 ) = V(Ime-). Since 
E(X 2 (X 4 + Y 4 )/(X 2 + Y 2 ) 3 ) = § , the result now follows from 



X 2 f 2 \ _ ( X 2 



E( W^72(4) 2 ) = E( w^72)E((4) 2 ) = ^(4 +4)« 2 , 



X 2 . \ / X 2 /X 2 £l x + Y 2 £ x 



H^T^^J = E {x 2 ^Y 2 -{ V + Y* +e f)h" 
= g(Mx + A*+)A*-5-« 2 » 



e (x*Ty* s *) = (i^ + \& +fixfi+ + l(°+ + t*l) 



u 



The constant n depends on the distribution of X and Y and takes values in the 
interval [g, l] ■ For the standard normal distribution we obtain n = |. 

2.8 Conclusion. It follows from Propositions 2.1, 2.3, 2.6 and 2.7 that the roundoff 
error A° := &(X oY)-XoY = \XoY\e° satisfies 

E(A°) = 0, V(A°) = E(|Xoy| 2 )E(|e°| 2 ) (2.1) 

for all arithmetic operations o e {+, — , x, -i-}. 

This is essentially a consequence of the assumption that the data has expectation 
zero and is independent of the relative error. It holds even when n° ^ 0. But (2.1) 
is valid even without the assumption on the expectation of the data, if we know 
E(e°) = fj = instead. 

In simulations, the standard deviations of the roundoff errors e + and e x are up 
to 10% larger than to be expected from Propositions 2.1, 2.3, 2.6 and 2.7. Partially 
this is due to the effect observed in Remark 2.2, partially to the fact that, although 
the result of an operation and the roundoff error are uncorrelated, our stronger 
assumption of independence is only approximately satisfied. 

3. Stochastic roundoff model 

Motivated by the propositions in the Section 2 and Remarks 1.9 and 2.8 we will 
describe the roundoff errors in operations with real or complex numbers by the 
following model: 

3.1 Model. For each roundoff error A = fl(A A oY) — XoY = \ X o Y\ • e° occuring 
during the computation of an elementary arithmetic operation o in K = K or C we 
make the following assumptions: 

(Ml) The relative error e° is independent of the data (and in particular independent 
of the operands). 

(M2) The relative error has expectation E(e°) = and variance V(e°) = c 2 o u 2 . 
Therefore the total error A° has expectation and variance 

E(A°) = 0, V(A°) = \\X o Yf a 2 a u 2 , (3.1) 

where \Z\ := Ed^j 2 ) 1 ' 2 denotes the L 2 -norm of a random variable Z. The constant 
o\ depends on the operation o and the number field IK. It follows from the results 
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in Sections 1 and 2 that o^ + = <r 2 + = a+ +//+, cr 2 x = a\ + ^ 2 and <7 C 2 X = cr 2 + c+ + 
A*x + M+j where the basic constants <r and /i (o = + or x) are defined in (1.3) 
and (1.4). In the case IK = C of complex operands we further assume that Ree° 
and Ime° are uncorrelated and V(Ree°) = V(Ime°). This is motivated by parts 
(ii) and (iii) of Propositions 2.1, 2.3, 2.6 and 2.7. 

(M3) We assume that all relative roundoff errors e~ occuring within one step of an 
algorithm are independent. This implies that all the total roundoff errors Aj are un- 
correlated. In fact, if the vector X denotes the data, we can write each roundoff er- 
ror in the form Aj = Ejfj(X) and therefore Cov(Aj, A&) = E(ejfj(X)ekfk(X)) = 
E(ej)E(ek)E(fj(X)h(X)) = 0. 

(M4) We furthermore assume that all the input variables are uncorrelated with 
known expectations and variances. We then also know the £ 2 -norm 



|X| = (V(X) + |E(X)| 2 ) 



1/2 



If the data is complex valued, the real and imaginary parts have to be uncorrelated 
with equal variances. 

(M5) Finally we assume that roundoff errors occuring in one step of the algorithm 
are propagated linearly through the later steps. In particular, we neglect terms of 
order 0(u 2 ). 

3.2 Remarks. The last part of condition (M2) in the complex case is obviously 
equivalent to E([e°] 2 ) = E([Ree°] 2 - [Ime°] 2 ) +2iE(Ree° Ime°) = 0. Similarly, 
the last part of (M4) is equivalent to E(X 2 ) = [E(X)] 2 for each component of the 
data. By a simple calculation we see that this property is preserved for every linear 
combination of uncorrelated components. Therefore, by (M4), for any algorithm 
proceeding in a series of matrix vector products, the condition on the real and 
imaginary part holds for all intermediary results. 

If an algorithm consists of several steps, then the assumption that the data are 
uncorrelated has to be checked for the intermediary result used in each step. For 
many algorithms each step consists in the multiplication of the data with a unitary 
matrix (or a multiple thereof). If the data are uncorrelated and all variances are 
equal, then these two properties are preserved in each step. Note that our model 
cannot be applied directly and has to be adapted if one of the algorithms in Remark 

2.4 for the complex multiplication is used. Nonunitary algorithms often show more 
irregular roundoff error distributions (see for example [2], Figure 3 and Remarks 

2.5 and 3.10 below). 

In condition (M2) of Model 3.1 we assume that the constants o~^ do not depend on 
the distribution of the data. For multiplication (and to a large extent for division) 
this is motivated by Propositions 1.4 and 2.3, whereas for summation we argue that 
the distribution of the exponent and sign of data (which essentially determines the 
roundoff errors) is well approximated by what would be obtained for standard 
Gaussian data - at least for most of the intermediary results of an algorithm. 

Instead of the independence assumptions (Ml) in our Model 3.1 it is also possible 
to use the simpler but less realistic condition that the absolute error A° is uncorre- 
lated with the data. This makes the calculations much easier and in the end leads 
to the same results. As the result of an average case study of a particular algorithm 
we usually state the variances of the components of the roundoff error vector and 
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its covariance structure. For specific values of the data it is quite possible that the 
roundoff error is much larger or smaller. 

In many cases, in addition to (M4) we will impose the assumption that the data 
has expectation zero. Then the norm fX\\ is equal to the variance V(Z). The 
general formula 

\\X + Yf = \\X\\ 2 + ||y|| 2 + 2Re(E(X)E(F)) (3.2) 

for uncorrelated variables of arbitrary expectation then takes the simpler form 
|X + y| 2 = |X|| 2 + |y| 2 , which alleviates the computation of the norm of the data 
in subsequent steps of an algorithm. The assumption that the data has expectation 
is valid for many algorithms (e.g. JPEG). However, the assumption that the data 
are uncorrelated is essential for our results, and so the analysis of a compression 
algorithm, where it is the main point that the data is not uncorrelated, would 
require a more complicated model. 

3.3 Remark. It is interesting to note that for independent U, X, Y in IK = R or C 
with E(X) = or E(Y) = 0, the variance of the roundoff error in the calculation 
of UX + UY = U(X + Y) is a 2 x (\\UX\\ 2 + ||E/r|| 2 )u 2 + a 2 + \\UX + UYfu 2 if we 
calculate the left hand side (using two multiplications and one addition), and the 
variance is a 2 x \\U(X + Y)\ 2 u 2 + |£/| 2 ct k 2 + |X + Y| 2 u 2 for the roundoff error of 
the right hand side. Although the number of multiplications is different, the two 
variances are equal. The same holds for the use of the distributive law with more 
than two summands. Therefore with respect to roundoff errors in a matrix- vector 
multiplication, it matters little if we exploit the equality between matrix entries of 
the same row to reduce the number of multiplications (as long as the data have 
expectation 0). 

As an auxiliary result needed for the study of the roundoff error of a matrix- 
vector calculation, we investigate properties of the "inner product" x T y = ^™ =0 x jUj 
of two vectors x, y E C n . 

3.4 Lemma. Let X = (Xj)™~ and Y= (Yj)""~ be independent random vectors in 
K = R or C with uncorrelated components satisfying condition (M4) and E(Yj) = 
for all j = 0, . . . , n — 1. Then we have 

(i) The roundoff error A occuring in the recursive calculation of 



n-l 

Y^ XjYj = (((X Y + X1YJ + X 2 Y 2 ) + ...) + X n _ t Y 7 



n-l 



has expectation zero and variance 



-i 



'(A) = ((n - l)|X | 2 |y | 2 + E( n " J')l*il 2 l^ 



u 



n-l 



(ii) For cascade summation the variance of the roundoff error is 

n—l n—1 

V(A) = (riog 2 n] £ ||A,| 2 |yj 2 - r)a 2 + u 2 + £ IXJ 2 ^! V n 

3=0 ' j=0 



2 u 2 
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where 

[log 2 n] -2 



r := 



E 



s = 
fn/2 s ] odd 



is the sum of the norms r s of the term which is left out in stage s of the cascade. 

Proof, (i) Since E(XjYj) = 0, this follows as in [15], Lemma 5.1. 
(ii) The calculation of the products XjYj gives rise to the roundoff errors A* = 
Millie Ilk* of expectation and variance ||Xj|| 2 ||lj|| 2 a 2 x u 2 by (3.1). By (M3) 
the variance of the sum of roundoff errors is equal to the sum of variances. At 
stage s = 0, . . . , |~log 2 n\ — 1 of the cascade the |~n/2 s ] summands are grouped 
in pairs. If |~n/2 s ] is odd, one term, which has variance r s , is carried to the 
next stage unchanged. The remaining pairs are added; the computation of the 
sum ^2j e gXjYj with S C {0, ...,n — 1} produces a roundoff error of variance 
vllEjesX^gu 2 = ^E.^lll^lil^li^ 2 - Hence by (3.2) the variance of 
sum of the roundoff errors at this stage equals (^™ =0 III Xj ||| 2 1|| Yj ||| 2 — rs)o~ 2 + u 2 . ■ 

3.5 Remark. As in the worst case study (see [7], p. 90) the size of the roundoff 
error depends strongly on the order of the summation, and it is clear from (i) that 
more accurate results are obtained, if we start the summation with terms of small 
norms. Likewise, the number r in (ii) depends on the order in which the summands 
in the cascade are combined. In order to obtain a small variance of the roundoff 
error, we try to make r large. For example, if all ||| Xj ||| ||| Yj\\ have the same value 
p, then at all stages s of the cascade where terms are left out, i.e., where |~n/2 s ] 
is odd, we can make r s as large as 2 s p 2 , and therefore the maximum is easily 
calculated as r = (2< log2n > — njp 2 . This leads to a variance of the roundoff error 
V(A) = (n|"log 2 n] + n -2^°^ n ^)p 2 <j 2 + u 2 + np 2 a 2 x u 2 . 

If the expected values of the data are not 0, the situation is more complicated. 
We will analyse the simplest case only. 

3.6 Lemma. Let n = 2* and X = (Xj)™~ and Y = (Yj)™~ be independent 
random vectors with uncorrelated components. Then the roundoff error A in the 



calculation of ^™ =0 XjYj by cascade has variance 

n-l 

V(A) = Y J lX,l 2 l\Y J f(ta 2 + + a i 



2 )u 2 



3=0 

+ 2 J2 <j, k) Re(E(A i )E(y 3 )E(A fc )E(y fc ))a I , 2 + n 2 (3.4) 

3<k 

where c(j, k) := t + 1 — min{s = 1, . . . ,t : [^J = L^JI- 
Proof. Now we use (3.2) to evaluate 

lE X ^l2=El^l2l^l2+2 E M E ( X 3 Y 3MX k Y k )) 
j€S j€S j<k€S 



for S C {0, . . . , n — 1}. Since every compound term E(XjYj)E(XkYk) arises exactly 
c(j, k) times in the cascade, the result follows. ■ 
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3.7 Example. If all products E(Xj)E(Yj) are equal to fj,, then the double sum in 
(3.4) is equal to n(2n— 2— t)\p\ 2 a 2 + u 2 . Here we have used the identity Ylj<k c 0'' ^) = 

5ELi2 s (2 s -l)# = f(2n-2-t). 

In the following result we will study the roundoff errors from a matrix-vector 
product AX. In addition to the arithmetical roundoff errors studied above, we 
also have to consider precomputation errors in the coefficients Aj^ of the matrix 
A. Here we have to take into account that when several entries of the matrix 
are equal, then the corresponding precomputation errors are also equal, as well as 
the roundoff errors from identical multiplications. More generally, we study the 
consequences of 

Q.((ax)y) — (ax)y = a(Q.(xy) — xy) for a € {0, 1, — l,i, — i} and x,y G C. (3.5) 

3.8 Theorem. Let A = (A,-,*)"^ = a(0) + EHi a i AW with a t £ C for I > 1, 
Afl £ {0, 1, -l,i, -i} and A^Af'J = for all < I ^ I' < m, j, k = 0, . . . ,n - 1. 

Let 

m 

A = A<°> + J>, + tf)A {l) 
i=i 

be the matrix with the precomputed entries of A and let X have uncorrelated com- 
ponents X\ of variance p 2 and expectation 0. We assume that the products A- k Xk 
are computed exactly, that the conditions in Model 3.1 are satisfied and that inter- 
mediary results of summations are only used once. 
(i) Then A := Q(AX) — AX has the covariance matrix 

m 

Cov(A) = ^u 2 + p 2 JXh-T) + k|V,V)A (Z) A«* (3.6) 

i=i 

where T, + is the diagonal matrix with entries 

n-l 

s+. = ((„ - i)|A,- | 2 + x> - oi^i 2 ) <y 

fc=i 

in the case of recursive summation. Each a 2 x is either a 2 x or o~ 2 x depending on 
whether m £ R U iR or a\ £ C\(RU iR). For cascade summation, E^~- = 

(|~log 2 n\ Y^k=o \Aj,k\ 2 —r(j))o- 2 + p 2 with r(j) > defined as the number r in Lemma 

3.4 (ii) with Y equal to row j of A. 

(ii) The norm of the error satisfies EdlAH 2 ,) = A;^lE(||^C|||)ii 2 with the constant 



1 1 

^:=^(riog 2 nl||A|||-r)^ + +i^(E(|^| 2 /n 2 ) + |a,|X 2 x )l|A«||l (3.7) 

i=i 

in case of cascade summation. Here \\A\\p denotes the Frobenius norm of the matrix 
A and r := Y^j=o r U)- If n is a power of 2, then r = 0. When all constants ai are 
in R U iR or all are not in R U iR, then all a 2 x in (3.6) are equal and formula (3.7) 
simplifies to 

fc 2 = V(riog 2 nl||A|| 2 -r)a, 2 + +||A-A(°)|| 2 ^ 2 x + VE(|^| 2 /n 2 )||A«|| 2 ). (3.8) 



i=i 
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Proof. By (M3) and the assumption Cov (X) = p 2 I n , the total precomputation 
error AX — AX = Ylh=i ^f^ -^ nas the covariance matrix 

^E(|«5f 4 | 2 )A«Cov(X)A«* = / 9 2 ^E(|^| 2 )A«A«*. 

1=1 1=1 

By (3.5) the roundoff error of the product A^ k X k equals A- k A k , where I is the 
unique non-trivial summand for component (j, k) in the above decomposition of A, 
and we have set A, := \aiX k \ef k for I > 1 and A") := in the case I = 0. There- 
fore the total roundoff errors from the multiplications is A x = Y^iLi -^ A®. Since 
Cov(AW) = \ai\ 2 a 2 x p 2 u 2 I n and (M3), A x has the covariance matrix Cov(A x ) = 

p 2 u 2 Y^iiLi \ a i\ 2,T K,xA A an d the result (i) follows. The contribution of the sum- 
mations is as in Lemma 3.4. By the assumption on the intermediary results, the 
components of these roundoff errors are uncorrelated. 

Since E||X|||) = rip 2 , k 2 is n~ 1 p~ 2 times the trace of Cov (A). Assertion (ii) 
now follows from tr(BB*) = \\B\\ F . ■ 

3.9 Remark. Clearly, if A^ ' is the elementary matrix Ej jk , we have A^'A^' = 

Ea a. In the case of a rotation A = ( . , , ) =cos<^A^ ' +sin(/> A" 'with 

J,J \— sm0 cos<pJ 

A (1) = I 2 , A (2) = ( _° n ) we have A (l) A (l) * = I 2 for 1 = 1,2. The covariance 

matrix of the roundoff error is therefore p 2 (a 2 + u 2 + a 2 x u 2 + E(|e) cos | 2 + |5 sm | 2 )) I 2 - 
If we assume that the relative error e tr in the (pre-) computation of a trigonometric 
function has variance V(e tr ) = a 2 r u 2 then this simplifies to p 2 (a 2 + + a 2 ^ + a 2 r )u 2 I 2 . 

The same is true for reflections A = [ . . ' , ) . If all A^ 3 ' (j > 1) in the 

V sin (p — cos <p J 

decomposition of A are of one of these types, then the components of the roundoff 

errors are uncorrelated. However, in general this is not always the case. 

It is an interesting consequence of Proposition 3.8 (ii) that for the multiplication 

with the conjugate matrix A* the square of the norm of the roundoff error has the 

same expectation as for the original algorithm, although the covariance matrices are 

different in general. For example, this is the relation between the Cooley-Tukey and 

Gentlemam-Sande algorithms, and between the DCT-II and DCT-III transforms. 

It follows from ||-B||f = ||-B*||f for all matrices B. 

3.10 Remark. The multiplication of a vector X £ R 2 with a rotation matrix 
cos d> sin < 



, takes four multiplications and two summations. For 6 S 
sin (p cos <p I 

] — 7r, 7r[ it is also possible to compute AX by only three multiplications but three 

additions using three lifting steps (see [6]), i.e. the (nonunitary) decomposition 

1 J ( s 1 J U l) wheres :=sin^andt := ±=£»* =tan|. If s 
and t are precomputed as s = s + 5 sin and t = t + S tan , then 

1 i\ f 1 0\ (1 i\ _ (s5 tan + t5 sin t 2 5 sin - 2(1 -ts)5 tan \ , 2 , 
o i)\-s i)\o i)~ v 6sin s d^ + tS sin J+ U { u )■ 
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Hence the precomputation errors propagate into the final result as 
, sd tan + t5 sin t 2 S sin - 2(1 - ts)5 tan \ f X x 



5 sin s£ tan + £<5 sin J \X 2 



In general the components of this vector are correlated and have variances V(A^ r ) = 
(2s 2 t 2 + t 4 s 2 + 4(1 - ts)H 2 )a 2 rP 2 u 2 and V(Af) = (s 2 + 2s 2 t 2 )cr 2 r p 2 u 2 . For simpli- 
fication we assume as above that the relative error e tr has variance V(e* r ) = <r 2 r u 2 . 
The arithmetical operations occuring in the first matrix-vector multiplication 
produce a roundoff error A (1 ) with A^ = 0andV(A^ 1} ) = (t 2 a 2 x +(l+t 2 )a 2 + )p 2 u 2 . 
It is propagated into the end result as 

1 l \( l °\aW -a (1) ( cos<t) 

l) \s l)* " A i V-sinc/ 

— (--)— -(io^-rro-™— ■ 

tion produces a roundoff error A^ 2 ) with V(A2 ) = (s 2 (l + t 2 )a 2 x + a 2 _^p 2 u 2 and 
A 2 = 0. It appears in the end result as I I A 2 = A 2 . I I . The last matrix 

° perateS ° n (-, l) (J l) {xl) = {-sX^+ll-Ux,)- The rOUnd ° ff 

error A^ 3 ) has A^ 3) = and V(A^ 3) ) = (t 2 ^ 2 ^ + cr 2 + )p 2 u 2 . 
In total we obtain the error vector 

a = ap - + (; 0(-» ?) ai,,+ (o a<2 ' +a<3 ' 

which has correlated components and satisfies 

E(||A|| 2 )=V(A 1 ) + V(A 2 ) 

6 + 2 cos 2 (f) tan 2 - a 2 r + f 3 + 2 tan 2 - j <j m 2 + + 6 tan 2 - a 2 x )p 2 u 2 . 

For small rotation angles (ft, the roundoff error is less than for the classical rotation, 
since the precomputation and multiplication errors are smaller, but if \(ft\ > ?, the 
variance of the roundoff error from the lifting method is larger. 

As one benefit of the use of lifting steps it is often claimed that they allow 

lossless reconstruction, since the inverse of I I is I I and therefore the 

precomputation and multiplication errors in the lifting step and its inverse cancel 
exactly. However, apart from the innermost lifting, this property only holds if the 
floating point addition x + z (where z = ty) followed by the subtraction of z yields 
exactly x = x again. Let the probability of this event be denoted by p > (from 
simulations with standard normal data one estimates p ~ 0.61, but if z is small - 
this is the case for small angles (ft - the probability p is close to 1). However, if this 
event does not happen, the roundoff errors from the multiplications t ■ x and t ■ x 
are distinct and do not cancel. Hence in the j-th to last lifting step the probability 
for the cancellation of the multiplicative errors is only g J_1 . The precomputation 
errors cancel in both cases (up to terms 0(u 2 )). 
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4. Application to DFT and DCT 



A usual simplification in the average case study of the roundoff error in the fast 
trigonometric transforms is the assumption of independence of the roundoff errors 
from the butterfly operations y\ = xi + UJX2, yi = x\ — ujx 2 (where \uj\ = 1). 
Since here the same product enters into two components, these are obviously not 
independent. This problem has already been noted in [4]. Also, usually for all 
multiplications the same roundoff error model is used, although the trivial factors 
1, — 1, i, — i do not induce any error. 

In order to highlight the influence of these assumptions we will calculate the 
covariance matrix of the roundoff error without this simplification in the case n = 8. 
This case has the most most important applications (e.g. JPEG) and also the effects 
are more pronounced for small values of n. Except in Example 4.3, we always 
assume that the data vector X has expectation and uncorrelated components of 
variance \\Xj\\ 2 = p 2 . 

We begin with the study of the discrete Fourier transform x := F n x, {x £ C n ), 
where F n := n~ 1 ' 2 (uj^ k )^^ =0 with uj = uj n := exp(— 2iri/n). In the following, 
I n will always denote the identity matrix, and J n the counteridentity matrix 
(<5j+fc,n-i)"fc= of order n. 

4.1 Example. First we study the naive algorithm for the computation of the DFT 
X = FgX via the matrix-vector multiplication (\'8Fg)X followed by a multiplica- 
tion with l/\/8. It uses one function call (for cos 4? = sin -£■ = 2 ), ^6 summations 
(both for recursive or cascade summation) and 32 complex multiplications. Note 
that the matrix entries to = 1,uj 2 = — i,u; 4 = —1 and to 6 = i give trivial multi- 
plications. Furthermore, if we exploit the special form of uj = ^-(1 — i), only four 
complex products have to be computed. 
Let A (0) , A (1) e C 8x8 be defined by 

;,;) i (— i) jfc / 2 for j k even, m\ [ for jk even, 



j ' k " \ for jk odd, hk ' 1 (-i)C»fc-i)/2 for jk odd . 

Then we have 

and we can apply Proposition 3.8. If the direct call results in fl( 2 ) = 2 "^ < ^ v ^' 
then the computed value for a\ := uj is Co = w(l + V25"^), which has variance 
2cr^- = 2V((5 %/ ~) = n 2 /6. Here we assume that 8^~ is uniformly distributed on an 
interval of length u. Hence in (3.6) we have V(<5^) + |ai| 2 o"^ x = 2o~J- + o~ 2 x and by 
a simple calculation we obtain A*- ' A^ ' = AK with 

K := Ei^i + E 3i3 , + E 5t5 + Ejj — Ei^ — E 5t i — E 3 j — E-j^ . 

In the case of recursive summation the variance of each component of S + is Y,~j ■ = 
{7+^2 k=1 (8—k) s ja 2 + p 2 = 35<7 c 2 +/ o 2 for j = 1, . . . , 8. In the case of cascade summation 
this has to be replaced by £+■ = 3 • 8a 2 + p 2 = 24<j 2 + p 2 . 

Taking into account the final multiplication with 1/yo, we obtain for the total 
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roundoff error vector A = Q(FsX) — F$X the covariance matrix 



Cov (A) 



/a 




















\ 







a + j3 











-p 
















a 




























a + j3 











-p 


2 2 














a 











p 11 





-0 











a + (3 




























a 







Vo 








-13 











a + (3/ 





(4.2) 



where (3 = g(V((5] 4 ) + |ai| 2 o" I ^ x ) = crj- + ^a 2 x , and a = ^cr 2 + for recursive summation 
resp. a = 3<r 2 _|_ for cascade summation. In particular, since E(|| JCUl) = 8p 2 , 



E( 



|A||1) 



2„.2 



(8a + 4P)p z u 



a +-p 



E 11X11 



u 



4.2 Remark. The precomputed matrix entries are considered random in the com- 
putations above, uniformly distributed between the two next machine numbers. 
But if the arithmetic of the computer is fixed, unlike the effects of the randomness 
of the data, the roundoff error <5 W of a precomputed constant u> remains the same 
every instance the algorithm is called. Therefore, in simulations we obtain results 
which are conditioned on these specific values. This explains the strong irregular- 
ities in the mean error of an FFT for different orders n (see for example [7], Fig. 
23.1 or [13], Fig. 3) which often are much larger than the standard deviation of the 
errors. 

We can therefore obtain a model better fitting the roundoff errors, if we replace 
a t = E(|(5^| 2 ) with the square of the exact error (which, for example in the calcu- 
lation of the constant \\/2. above, is 0.190u 2 for 53-bit mantissa but only 0.041-u 2 



for 24-bit mantissa, compared with the expected value a 2 



12 



Proposition 3.8 



is still applicable under this situation. Naturally, this approach is only possible if 
we are able to compute the entries to higher precision, and it is cumbersome when 
the matrix has a large number of different precomputed entries. If we cannot apply 



this more precise model, we have to use E(|5|" 



-2k u 2 



'/12 for 2 



-fc-i 



< M 



<2" 



(fc€ 



which follows from the fact that in this interval the distance between two 



consecutive machine numbers is 2" 



u. 



4.3 Remark. Let us study for a moment the influence of non-zero expectation of 
the data vector X in the case of cascade summation. If E(_X") = \i = (pjYj = q and 
p 2 := V(Xj), then |X,| 2 = p 2 + \pj | 2 . It follows from Lemma 3.6 that we now have 



3=0 3=0 j=0 

= (24p 2 + 2|| A i|| 2 + 8|/i(£:)| 2 K + 



with jj, = E(-X") 
entry by a k := 



(3' := (1+ 



Hence in (4.2) we have to replace the a in the fc-th diagonal 
li"|| 2 + ~s !/*(&) | 2 )°"<Jf- Likewise, (3 has to be replaced by 



. 3 + 4p 2 



4p 2 



lA t i| 2 +lA t 3| 2 +lA i 5| 2 +l/- t 7| 2 )) { c {/-~^^ a €x)- Furthermore, since the matrix 
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(E(Xj Xk)) . ,_, is no longer a multiple of the identity matrix, additional terms 

outside the diagonal appear, and we obtain 



Cov (A) 



with 7 := 





/a 




















^ 









«!+/?' 





71 





-p 





-71 












0:2 



















) = 






-71 








a 3 + f3' 




a A 


71 








-p 




2 2 
p u 







-p 





-71 





a 5 + p 





71 
























Q 6 







Vo 


71 





-p 





-7i 





a 7 + p ) 




vK 


Wl 2 


-|^s| 2 + 


l^sl 2 


-Kl 2 )( 


t + 


-a 2 ) 









The FFT is based on the factorization of the Fourier matrix 

F n = n- 1 / 2 A t ...A 1 P n (n = 2*), 

where P n is the bit-reversal permutation matrix and each Aj has exactly two 
non-zero entries in each row, one of them = 1. In our special case n = 2 3 , 






D 4 



diag 






D 2 
-D 2 



with D4 := diag(l,w,ix' 2 ,a; 3 ), 

with D2 := diag(l, — i) 



D 2 
-D 2 



and A\ is block-diagonal with four blocks 



1 
-1 



4.4 Example. In the computation of the FFT X = 8~ 1 / 2 A 3 A 2 A 1 P 8 X, one direct 
call, four complex multiplications and 24 additions are used, since only A3 contains 
nontrivial factors u and u> 3 = —\uj. As in the case of the naive algorithm we do not 
take into account the error from the final multiplication by 8 -1 ' 2 . 

The butterfly operations in the multiplication with A\ and A2 give each rise 
to independent roundoff errors A"J and A"^ of variance 2a 2 + p 2 u 2 resp. Aa 2 + p 2 u 2 in 
every component - no multiplications are necessary. Then X^ ' := A2A1PSX has 
uncorrelated components of variance 4p 2 . 



Let A<°> := 



U D' A 

Ia -D' a 



withi?4 :=diag(l,0,-i,0),and A (1) := E ly5 -iE 3J - 



1(0) 



1(1) 



E 5:5 + iE 7 j. Since 

A 3 = A w + ujA ( 

the roundoff error A3 from the multiplication A 3 X^ ' has the covariance matrix 
Cov (A 3 ) = 8p 2 a 2 + I 8 u 2 + 4 / 9 2 (2cr 2 - + a 2 x )Ku 2 by Proposition 3.8, where K and o 2 - 
have the same meaning as in Example 4.1. 

Using the fact that 2 _1 ' 2 Aj is an orthogonal matrix (j = 1,2,3), the total 
roundoff error A = -h= (A3 + A3A2 + A3 A2A1) has the covariance matrix 

Cov (A) = ^(Cov (A 3 ) + A3C0V (A 2 )A* + A3A2C0V (Ai) A* 2 A* 3 ) 



3a 2 ^ I 8 u + p 2 (aj- + -a 2 x )Ku 2 , 
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which is the same matrix as (4.2) with j3 = qj- + 2 c 2 x an< ^ a = ^ a €+- Again, 



EfllAII 



a + -P) E{\\X\\i) 



with the same a and (3 as in the naive algorithm with cascade summation. 

4.5 Example. With the Gentleman-Sande algorithm, the FFT is computed via 

X = S-^PsAjA^AlX. Since A T 3 = A™ + W A« with A™ :-- 



n" 



D'i ' 



D'l := diag(0, 1,0, -i) and A (1) := -E ly5 + \E 3J + E 5A - iE 7>3 , it follows from 
Proposition 3.8 that the roundoff error A3 from the product A\X has the covariance 
matrix Cov (A3) = 2p 2 a 2 + I 8 u 2 + p 2 {2a 2 - + a 2 x )K'u 2 with K 1 := A (1) A (1) * = 
2£J5 j 5+2£^7j. Since later no proper multiplications occur, and since the components 
of A\X and A 2 A 3 X have variance 2p 2 resp. 4p 2 , the roundoff errors A2 and 
Ai from the product with A\ resp. A\ have covariance matrices Cov (A2) = 
4p 2 a 2 + I s u 2 resp. Cov(Ai) = 8p 2 a 2 + I s u 2 . 

In total, A := -4=_Pg(Ai + A*A2 + A\A 2 As) has the covariance matrix 

Cov (A) = diag(a, a, a, a, a, a + 2/3, a, a + 2f3)u 2 

with a = 3ct c 2 + and /3 = q^- + \<J 2 X - We have used the identities A 2 K' = K' A 2 , 
A\K = K A\ and PK = K P. As in the example above, we obtain 

E(||A||i) 



a+-f3)E(\\X\\i)u<. 

We will now study the discrete cosine transformations (DCT) of type II and III. 
The DCT-II x := Clx (x £ R n ) uses the cosine-II matrix 



Ct 



-^(r lj cos(j(2k + l)^- 

n\\ 2n 



n-l 



with r]j 



1 for j = 0, 

' j,k=o I \[2 for j = 1, . . . ,n — 1. 

We will again treat the case n = 8 only, which is of special interest in the JPEG 
algorithm of image compression. 

4.6 Example. In the naive approach of the matrix-vector product, 64 multiplica- 
tions and 56 summations are used. By examining the entries of C 8 we see that 



Cl 






U) 



with ci = |7i 

Cg 2 COS -jg , 

C (3 
C (4 

C (5 
C (6 

c< 7 



and C7 



I cos I , c 2 = I cos I , c 3 = \ cos ^ , c 4 

2 COS 16 



2 COS ^g , C5 



2 COS 16 ' 



cos Ts- occuring at 



E. 



4,fc, 



^fc=0 °' ^fc=0,3,4,7 4 ' fc ^fc=l,2,5,6 

-£■2,0 — -^2,3 — E 2t 4 + -E-2,7 — -^6,1 + -^6,2 + -^6,5 — ^6,6 j 

E 2 ,\ — -^2,2 — E 2 £ + -E?2,6 + -^6,0 ~~ -^6,3 ~~ -^6,4 + ^6,7j 

Eifi — Ei j — E 3j2 + E 3>5 — E 5t i + E 5 $ — E 7>3 + £7,4, 

-£■1,1 — -El,6 + -^3,0 — -^3,7 + -^5,3 — -^5,4 + E 7>2 ~ -^7,5; 
£■1,2 — -El, 5 — -^3,3 + -^3,4 + ^5,0 — E 5 j — E 7> i + £7,6, 

-Ei, 3 — -Ei,4 — E 3j i + £3^ + E 5>2 — £ 5j5 + £7^ — E 7 j . 
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Since Cj G ]\,\[ f° r 3 = 1)2,4,5,6, cs G If it anc ^ c 7 G li6'|[' an< ^ assumm g 
that the exact values are uniformly distributed between the two nearest machine 
numbers, we have E(|<5f | 2 ) = |[ for j = 1,2,4,5,6, E(\5%\ 2 ) = f^ and E(|5f| 2 ) = 

=|g. As in the case of 5^ in Remark 4.2, these values should be replaced by the 
squares of the exact roundoff errors if they are known. By simple calculations 
we obtain C (1) C (1)T = 8£ ,o + 8£ 4 , 4 , C (2) C (2)T = C (3) C (3)T = AE 2 , 2 + 4# 6 ,6 
and C U) C U)T = 2E 1;1 + 2.E 3 , 3 + 2E 5>5 + 2E 7J for j = 4,5,6,7. Using 8c 2 = 1, 
4(c 2 + c 2 ) = 1 and 2(c 2 + c 2 + Cg + c 2 ) = 1, it follows from Proposition 3.8 that the 
roundoff error vector A has covariance 

o u 

Cov (A) = (3a m 2 + + a m 2 x )pV J 8 + diag(64, 49, 40, 49, 64, 49, 40, 49)^, 

if cascade summation is used, since £^\- = | Y2k=o Wj cos2 (J 7r 2k w 1 )P 2u2(j2 ,+ = 



3,3 8 A^/c=U -/j 



3p 2 a 2 ,u 2 . In the norm we have 



E(||A|| 2 ) = (3a m 2 + + a K 2 x + ^)E(||X|| 2 )n 2 . 

4.7 Example. Following Loefner et al. [9], for n = 8 the fast DCT-II is based on 
the formula 

cl = -±=plWl®lWlW 

8 v/8 
-,(i) ._ / J 4 Ji\ r (2) ._ 



with ZA - 1 := I r )) -^ := diag(/4, i?4) (where Ra is the product of the 

\ J a —1a J 

Givens rotations G(2,3, f^) and G(l,4, f§), see [7], p. 372), L (3) is block diagonal 

consisting of four butterfly matrices I I , L^ .= diag(i?4, B4) (where -R 4 

is block diagonal with a butterfly matrix and v2G(3,4, 3 ^ L ), and S 4 consists of a 
butterfly in the first and last components and V2I2 in the middle components) and 
Pis the permutation matrix with Pa = (ao, a 7 , 0,2, 05, ai, fl6) a 3) (iaY • It uses 14 
multiplications and 26 additions (not counting the final multiplication by 8 -1 ' 2 ). 
If the rotations are replaced by lifting steps (see Remark 3.10), it takes 11 multipli- 
cations and 29 additions. The roundoff errors for both versions are approximately 
of the same magnitude. In the following we study the first method: 

In the first step no multiplications occur and therefore the computation of X^ ' = 
L^ ' X produces a roundoff error A 1 - 1 ' with covariance matrix _RT' ' = 2p 2 a 2 + u 2 Is- 
.X"' ' has uncorrelated component of variance 2p 2 . It follows from Remark 3.9 that 
the covariance matrix of the error A^ 2 ' from the computation of JO ' = 2/ ' X^ ' 
is K {2) =2/? 2 diag(0,0,0,0,/c 3 ,/ci,A;i,A;3)u 2 with 

U. ._ 2 ,2 , J_ F /'|xco S (j 7 r/16)|2 , ,rain(j7r/16)|2\ _ f °"m,+ + c V,x+ I92 3 = 1 ' 
« I CV.+ + <\,x + 6 ■? ~~ 6 - 

For the precomputation errors we have used that | < sin fg < | and | < 
sin || , cos || , cos ^ < 1. 
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From the third step X^ ' = L^ 'X^ ' we obtain a roundoff error A^ 3 - 1 of covari- 
ance matrix K^ ' = 4p 2 a^ + u 2 Ig. Finally, since each component X- ' has variance 
4p 2 , in the last non trivial product X^ ' = L' ' X^ ' a roundoff error A^ 4 ' with 
covariance matrix 10- ' = 4p 2 diag(2a 2 + ,2a 2 ^,ko,ko,2a 2 t+ ,k2,k2,2a 2 + )u 2 arises, 
where k = 2o m 2 + +2a m 2 >< +n- 2 E(|<5^ cos ( 3 "/ 8 )| 2 + |^ sin ( 3w / 8 )| 2 ) = 2o£ + .+2o£ x +5/12 
(since \ < v^2cos ^ < 1 < v^sin ^ < 2) and k 2 = 2a 2 x + u- 2 E(|<$^| 2 ) = 
2a K 2 x + l/3. 

We will now study the covariance matrix of the total roundoff error 

A := 8" 1 / 2 p(a( 4 ) + L^A^ + L^L^A™ + L^L^L^A^A. 
A simple calculation shows that 8~ 1 ' 2 PL^ ' Lr 'A' 2 - 1 has the covariance matrix 

1 "r.(4)r(3)^(2)r(3)T r (4)T p T_ f h+h A : aa(n 10 10 10 Um * 3 ~ h \ * 2 



§ pn*>n*>K^>K*> L™ pi = ^^^ dia g ( o,i j0 ,l,0,l,0,l)+^-^M 8 jpV 

with M 8 := E lt3 + £3,1 + E lj5 + E 5A + £3,7 + £7,3 - E 5J - E 7>5 . Since K m and 
K} ' are multiples of 1$, they pose no problem. Hence the covariance matrix of A 
has the diagonal entries 



r (A, 



( 3al + p 2 u 2 J' = 0,4, 

(4a m 2 + + a m 2 x + J9A 2 J = 1,7, 
(^l + + a 2 x +^)p 2 u 2 j = 2,6, 



I {3a 2 + + 2a 2 x + |H )p 2 u 2 j = 3, 5, 
plus the following non-zero entries outside the diagonal: 



Cov (A) j:k = 

y ~ 256 



^§u 2 p 2 (j, k) = (1, 3), (1, 5), (3, 1), (3, 7), (5, 1), (7, 3), 



^W (j,*) = (5,7),(7,5). 



In the norm, this sums up to 

7 
E(||A|| 2 ) = ^V(A J ) = (^ 0ffi 2 + +o- m 2 x +^)E(||X|| 2 )u 2 , 
i=o 

which is slightly larger than for the naive Algorithm 4.6. Other algorithms for the 
computation of the DCT-II in the case n = 8 have been described by Chen et al. 
[5] and Schreiber (see [12]). Using a larger number of multiplications they both 
satisfy 

E(||A|| 2 ) = (^ 2 ++ ^ x + i|)E(||X|| 2 )n 2 

which is slightly larger than the roundoff error of the algorithm by Loeffler et al. 

The DCT-III X := C^x {x G R n ) uses the transposed matrix Cf := (C%f '. By 
orthogonality of C 8 it is the inverse of the cosine-II matrix. 
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4.8 Example. In order to study the naive algorithm of matrix-vector multi- 
plication in the case n = 8, we now apply the decomposition C 8 = (C n ) = 
^2j = iCjCr 3 ' , using the notations from Example 4.6. The covariance matrix of 
the roundoff error is no longer diagonal, since C^ ' C ^ ' = 2I 8 + 2J 8 + 2K 8 with 

K 8 := ( J T 4 * 4 Y C^C^ = d&d® = I 8 + J 8 -K 8 and C^C^ = I 8 -J 8 

\ li J a ) 

for j = 4, 5, 6, 7. By similar arguments we obtain (for cascade summation) that 

101 \ 2 2 T 1 2 2 T , 1 

768/ F 256 P 64' 



Cov (A) 



3cr„ 



+ Cv + 7^777 )P Z U Z I 8 + ——p 2 U 2 Js + ^-p 2 U 2 K 8 , 



where the fractional constants follow from the identities E(2|<5f | 2 + ^ 7=2 \^f\" 



101, ,2 



768 



64 



u 



E(2|«5f| 2 + |^| 2 + |^| 2 -£ =4 |<5ff 



256 



andE(2|5f| 2 -|<5f| 2 -|^|^ 



The norm is the same as for the naive DCT-II in Example 4.6. 



4.9 Example. The fast DCT-III for n = 8 proposed by Loefner et al. [9] is based 
on the factorization Cf = 8 -1 / 2 L (1) L (2)I L (3) L (4)I P t with the notations from 
Example 4.7. The covariance matrices K^ 3 ' (j = 1,2,3,4) of the roundoff error 

vectors A^ 3 ' resulting from the multiplication by L^ 3 ' remain essentially the same; 
only the leading factors 2p 2 ,2p 2 ,4p 2 ,4p 2 in Example 4.7 have to be replaced by 
8p 2 , 4p 2 , 4p 2 , p 2 due to the different variance of the intermediate results. A simple 
calculation yields 



L W K ( 2 ) L W l 



^(1)^(2)^(3)^(4)^(3)^(2)^(1) 



2(fci + k 3 )I 8 

2a 2 + (2I 8 + R 8 +^Q 8 



2(fci + k 3 )J 8 + 2(fc 3 

V2. 



ki)Nt 







+ k Q (I 8 


+ Js 




R 8 ) + * 


; 2 (/ 8 


-J{ 


-¥*)] 


where N 8 : = diag(l, — 1, — 1, 


1,1, 


-1,-1 


,1)(J 


i — 


J 8 ), R 


8 : = 


(Ja 
U 


) , and 

Ja J 


I ° 


1 


1 








-1 


-1 


\ 






1 








1 


-1 








-1 






1 








-1 


1 








-1 




Q 8 ■= 






1 
-1 


-1 
1 










1 
-1 


-1 
1 










-1 








1 


-1 








1 






-1 








-1 


1 








1 




\ o 


-1 


-1 








1 


1 


) 




Therefore the total err 


or 



















8 -i/2 f A (D + L (D A (2) + L^L™ A™ + L^L™ L<- 3 W> 



has the covariance matrix 



Cov (A) 



13 



-CC-L + CC V + ^7^7 ) Is 



+ iV 8 

256 



121 \ 
768/ 



rf» 



r 2 

r) "K,X 



o- m 2 ++ -)Q g 



— 1 

768/ 

1 2 5 
4^x + 96 )jR8 



2 2 
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which leads to the same norm estimate as for the fast DCT-II algorithm. Again, the 
fractional constants above are the estimates for the precomputation errors. If these 
are all known exactly, the precision of our result can be improved. For example 
the constant ^ku 2 occuring in the norm estimate can be replaced by the quantity 



768 
l(| 5 V2|2 + | (5 V2sin^|2 + | 5 V2cos^|2 +2 | (5S ir 

if we know the exact sizes of these errors. 



l +2\5 c 



l +2\5 s 



l +2\5 c 



2 )- 



4.10 Remark. In order to check the accuracy of our results by a simulation, we 
perform a discrete Fourier (or cosine) transformation followed by its inverse and 
compare the result with the original data. If K is the covariance matrix of the 
roundoff vector of the first unitary transformation, X i— ► AX and K that of 
the second, X i— > A*X and if the roundoff errors of the two transformations are 
independent, then the observed roundoff error H(A* fl(AX)) — X has the covariance 
matrix K = K' + A*KA. 

For the fast DCT-II (see Example 4.7) followed by the fast DCT-III (see Ex- 
ample 4.9), we have K = 2K . In the following diagrams of Figure 4 we compare 
this estimation with a simulation with 100000 samples in MATLAB. We have not 
used the built-in matrix-vector multiplication, which in MATLAB is implemented 
differently from the standard algorithm. 
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Figure 4: Computed (right) and simulated (left) covariance matrix for the roundoff errors of the 
fast DCT-II, followed by the fast DCT-III. The areas are proportional to the absolute values of 
the covariance and negative covariances are depicted by a darker shading. 

The minor difference between the two covariances (particularly outside the diago- 
nal) stem from two effects: firstly, since the precomputed constants are the same for 
every simulation, we observe the covariance conditioned on these values. Secondly 
the same constants appear in both the DCT-II and the DCT-III. This depencency 
increases their contribution to the covariance matrix K by up to a factor 2. 
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Abstract 

In this paper a general theory of semi-classical matrix orthogonal polynomials is 
developed. We define the semi-classical linear functionals by means of a distributional 
equation D{uA) = uB, where A and B are matrix polynomials. Several characteriza- 
tions for these semi-classical functionals are given in terms of the corresponding (left) 
orthogonal matrix polynomial sequence. They involve a quasi-orthogonality property 
for their derivatives, a structure relation and a second order differo-differential equation. 
Finally we illustrate the preceding results with some non-trivial examples. 
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1 Introduction 



The study of semi-classical orthogonal polynomials in the scalar case (i.e., 
the study of orthogonal polynomials whose associated functional satisfies a dis- 
tributional equation D(u<&) = -u\I/, where $, \1/ are polynomials with deg\l/ > 1) 
was started by Shohat ([14]) in order to generalize the properties of the classical 
orthogonal polynomials ([3]). 

Among others, [2, 9] present an approach to such polynomials taking into 
account the quasi-orthogonality of the derivatives of the polynomial sequence. We 
also mention the results of Maroni who, in [11, 12], develops an algebraic theory 
of semi-classical orthogonal polynomials. The purpose of this theory is the char- 
acterization of the semi-classical orthogonal polynomials by means of the quasi- 
orthogonality of their derivatives, a structure relation and a differo-differential 
equation of second order (see also [1]). 
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In the last years, the study of matrix orthogonal polynomials has attracted 
the interest of the researchers (see [4, 6, 8, 10]). As for their differential properties, 
it is known the result of Duran ([5]) who characterizes those matrix orthogonal 
polynomials satisfying a symmetric second order differential equation with poly- 
nomial coefficients. He proves that, in the positive definite case, if the differential 
equation has its coefficients on the same side, the matrix orthogonal polynomials 
are necessarily diagonal (up to factors) with classical scalar orthogonal polynomi- 
als in the diagonal. In spite of the variety of applications of matrix polynomials ( [4, 
5, 6]), there are not too many known families of semi-classical matrix orthogonal 
polynomials out of the diagonal case. 

One way to study many families of matrix orthogonal polynomials is to ex- 
tend the analysis of differential properties started by Duran. A natural way to do 
this is to generalize the theory of semi-classical scalar orthogonal polynomials to 
the matricial case. In order to do that, we start defining the semi-classical matrix 
orthogonal polynomials by a Pearson-type equation for the corresponding orthog- 
onality matrix functional. We prove that such matrix orthogonal polynomials are 
characterized by a (in general, non-symmetric) differo-differential equation. In the 
way to the proof, we find another equivalences that generalize the known ones in 
the scalar case. 

This paper has been organized as follows. In Section 3 we introduce and study 
the concept of quasi-orthogonality for matrix polynomials. 

The main result of Section 4 is the characterization of the semi-classical func- 
tionals in terms of the quasi-orthogonality of the corresponding matrix orthogonal 
polynomials and their derivatives. 

In Section 5 we obtain for the semi-classical matrix orthogonal polynomials a 
structure relation and a differo-differential equation, showing that they are equiv- 
alent to a Pearson-type equation for the related orthogonality matrix functional. 

Finally, in Section 6, we illustrate the preceding results with some examples, 
showing that there exist non-diagonalizable semi-classical matrix functionals. 



2 Basic tools 

Denoting by C*- m ' m - ) the set of m x m complex matrices, in what follows p( m ) 
will be the C (m > m ) -left-module 




a k GC (m ' m \ neN 
and P< m )' the C( m ' m )-right-module Hom(p( m \ C^ m ' m )) . 
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2.1 Definition 

(i) The duality bracket is defined by 

/. A • p( m ) x p( m )' __3. c( m ' m ) 

(P,u)-(P,u):=u(P) 

(h) Given -u G p( m )', the functional ux k I G P (m) ' (ft = 0, 1, . . .) is defined by 

(P,ux k l) :=(x k P,u), 
where I denotes the m x m identity matrix. A linear extension gives the 

n 

right-product uA G p( m )' for any A G p( m ), with A{x) = \2 a k^ k , in the 

fc=0 

following way: 

n 

(P,uA)=J2(x k P^)o: k . 

k=0 

(Hi) Every u G p( m ) , defines a matrix inner product in p( m ) by 

/. .) ■ p( m ) x p(™) _^ (£(rn,m) 

{P,Q)^(P,Q) u :=(P,uQ*) 
where Q* denotes the transposed conjugated of Q. 

Remark The duality bracket verifies the property of linearity 
(oi-Pi + a 2 P 2 , filUi + f3 2 u 2 ) = 

= a 1 (P 1 ,u 1 )/3 1 + a 1 (P 1 ,u 2 )/3 2 + a 2 (P 2 ,u 1 )/3 1 + a 2 (P 2 ,u 2 )(3 2 , 

for all ai,a 2 ,/?i,/? 2 e C( m ' m \ P U P 2 G P (m) and u u u 2 G P (m) ', while the inner 
product is sesquilinear, that is, 

(ojiPi + a 2 P 2 , f3 1 Q 1 + P 2 Q2) U = 

= a 1 (P 1 ,Q 1 )j* 1 +a 1 (P 1 ,Q 2 )j* 2 +a 2 (P 2 ,Q 1 )jt + a 2 (P 2 ,Q 2 )j* 2 , 
for all aua^PuPt eC( m ' m ) and Pi,P 2 ,Qi,Q 2 ePW. 

2.2 Definition We denote by G& := (x fc J, tt) tie k-th moment with respect 
to u G p( m ) . Given u G p( m ) with moments (Ck)^ =0 , we say that u is quasi- 
definite (non-singular) ifdet(Ck+j) k _ 7^ 0, Vn > 0, where (G/e+j)? _ is the 

Hankel-block matrix 

/Cq C\ ... C n \ 

Gi G 2 ... G n+ i 
\G n G n +i ... G 2n / 
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Remark Given a sequence (C^), C C < - m ' m - ) , there exists a unique u G p( m ) 
such that (x k I,u) = Ck- This establishes an isomorphism between p( m ) and the 



oc 

formal series N^Cfc^ with coefficients in c( m ' m ). 

fc=0 



2.3 Definition We say that u G P (m) ' is hermitian if CI = C k for all k>0. 
The following result is very well known (see [4, 6, 7, 8, 13]). 

2.4 Theorem u G p( m ) is quasi-definite if and only if there exists a unique (up 
to non-singular left matrix factors) sequence of left orthogonal matrix polynomials 
(P n ) n> with respect to u, that is: 

(1) P n GP( m \degP n = n. 

(2) The leading coefficient of P n is non-singular. 

(3) (P n , Pfc) = K n 5 n k for k < n, where K n is non-singular. 
This sequence verifies a (left) recurrence relation 

xP n (x) = a n P n+1 (x) + (3 n P n (x) +7 n P n _i(x), n > 0, 

p-i(x) = e, 

where a n , f3 n , 7 n G C ( - rn,m \ with a n , 7 n non-singular, and 9 denotes the zero 
m x m matrix. 

Remarks 

(i) The inner product (P n ,x n /) = (x n P n ,u) is non-singular for every quasi- 
definite functional u G p( m ) . 

(ii) If we consider the structure of right- module on p( m ) (so, left-module on p( m ) ), 
we can define in a similar way a sequence (II n ) n >o of right orthogonal ma- 
trix polynomials with respect to a quasi-definite matrix functional u. The 
difference is that the condition (3) of Theorem 2.4 must be substituted by 
(il£,n*) = L n 5 n k for k < n, with L n non-singular. The sequence (n n ) n > 
satisfies a right recurrence relation. 
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3 Quasi-orthogonality 

As in the scalar case, the semi-classical character will be closely related to the 
idea of quasi-orthogonality. But the scalar quasi-orthogonality, when generalized 
to the matricial case, splits in two different concepts. Both of them will play an 
important role in the characterization of the semi-classical matrix functionals. 

3.1 Definition Let (Q n ) >0 be a sequence of matrix polynomials such that the 
leading coefficient of Q n is non-singular and degQ n = n. 

1. We will say that (Qn) n>0 is a sequence of left quasi-orthogonal matrix poly- 
nomials of order r with respect to v E p( m ) if 

(a) (x k Q n ,v) — 9, 0<k<n — r — l, n>r + l, 

(bl) There exists no > r such that (x n °~ r Q no , v) ^ 9. 

2. We will say that {Qn) n>0 is a sequence of regularly left quasi-orthogonal 
matrix polynomials of order r with respect to v E p( m ) if 

(a) (x k Q n ,v) = 0, 0<k<n — r — l, n>r + l, 

(b2) There exists no > r such that (x n °~ r Q no ,v) is non-singular. 

Notice that if r = the previous definition is the standard orthogonality. 
Furthermore, for m = 1 both definitions yield the scalar quasi-orthogonality. 

As we will see in the next proposition, when a sequence of quasi-orthogonal 
matrix polynomials is already orthogonal with respect to another functional, the 
conditions (bl), (b2) become stronger. 

3.2 Proposition Let u E p( m ) be quasi-definite and let (P n ) >n be the corre- 
sponding sequence of left orthogonal matrix polynomials. Given v E p( m ) , the 
sequence {Pn) n>0 is quasi-orthogonal of order r with respect to v if and only if 

(a) (x k P n ,v) — 9, 0<k<n — r — l, n>r + l, 

(bl ') (x n - r P n , v) ^ 0, Vn > r. 

The sequence (Pn) n>0 is regularly quasi-orthogonal of order r with respect to v if 
and only if 

(a) (x k P n ,v) = 6, 0<k<n-r — l, n>r + l, 
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(b2 J ) (x n ~ r P n ,v^j is non-singular \/n>r. 

Proof Let n > r verifying (bl) or (b2). Taking into account the recurrence 
relation (1.1), we obtain 

7n+1 {x n ~ r P n ,v) = (x n+1 - r P n+1 ,v) 

and (bl) or (b2) holds for n + 1. In a similar way, if n > r + 1, 

^(x^-rp^v) = (X^P^V) 

and (bl) or (b2) is satisfied for n — 1. □ 

Remember that, given w G p( m ) , we say that w — if (P, u;) = 6>, VP G p( m ). 
In what follows, the following result will be useful. 

3.3 Lemma Let u G p( m ) be quasi-definite and let (P n ) n > ^ e ^ e corresponding 
sequence of left orthogonal matrix polynomials. Then, given w G p( m ) , w = if 
and only if there exists no such that 

(x k P n , w) = 6, < k < n, n > n . 

Proof Assume the above orthogonality conditions. From the recurrence (1.1), 

(x k P no - 1 ,w) =7^ ((x k+1 P no ,w) -a no (x k P no+1 ,w) -/3 no (x k P no ,w)) =6 

for < k < no — 1. So, the hypothesis is true for n > no — 1, < k < n, too, and, 
by induction, for < k < n. Therefore, (P n , w) = 9 for all n > and, thus, w = 0. 

□ 

Given a sequence of orthogonal matrix polynomials with respect to a func- 
tional u, its quasi-orthogonality with respect to another functional v can be char- 
acterized by a simple relation between u and v. 

3.4 Theorem Let u 7 v G p( m ) such that u is quasi-definite and let (P n ) n >o ^ e 
the corresponding sequence of left orthogonal matrix polynomials. Then, 

(i) (P n ) n >o is quasi-orthogonal of order r with respect to v if and only if there 
exists A G p( m ), with degA = r, such that v = uA. 

(ii) (P n ) n > is regularly quasi-orthogonal of order r with respect to v if and only 
if there exists A G p( m ) , with deg A = r and non-singular leading coefficient, 
such that v = uA. 
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Moreover, in any case, the matrix polynomial A is unique. 

Proof We will do the proof just for (i) because for (ii) the arguments are analogous. 

r 

(<=) Consider A(x) = \^ a j x K with a r ^ 9. Then, if v = uA, 

3=0 



1 x k 



Pmv) = Y,{x k+J Pmu)a 3 

J=0 



Hence, for < k < n — r — 1, we have 

(x k+ JP n ,u) = 6, 0<j<r, 

and, for k — n — r, 

(x k P n ,v) = (x n P n ,u)a r ^e. 

So, (Pn) n >o is quasi-orthogonal of order r with respect to v. 

r 

(->) We will prove that = v — y^x-'uaj has a unique solution (aj) r j =0 . 

3=0 

For all n > r, the system of r + 1 equations 






"P n ,v) = (x n P n ,u)a ( ?\ 



(2.1) 



k (x n P n , v) = (x n P n , u) a ( n) +■■■ + (x n+r P n , u) a 



( n ) j u/^+r p „A„(») 



has a unique solution in the unknowns (a: )J =0 . Notice that the first equation 
leads to af" ^ 0. 

We will prove that (a n )J=o are independent of n. From (1.1), the relation 

7n+ i (x k P n ,v) +/3 n+1 (x k P n+u v)+a n+1 (x k P n+2 ,v) = (x k+1 P n+1 ,v) (2.2) 
is verified for all k > and for all -0 G p( m ) . The first equation of (2.1) gives 

(x n - r P n ,v) = {x n P n ,u)a^ 



x 



" +x - r P n+1 ,v) = (x n+1 P n+1 ,u)ai n+ V 



(2.3) 



and, from (2.2), 

" 7n+1 (x n - r p n , v) - (x n+1 - r p n+1 , v) = e, 
7n+1 (x n p n ,u) - (x n+1 p n+1 ,u) = e. 



320 



M.CANTERO ET AL 



Equations (2.3) and (2.4) imply that a™ = a™ . 

Suppose dj = dj for j — k + 1, . . . , r, with k < r, and for all n > r. We 
will prove that a^. = a£ too. Taking into account (2.1) we have 



,x 



n — k 



P n ,v)=J2(x n ~ k+Jp n,u)a { j 



(n) 



j—k 



(2.5) 



,.!• 



n - k+1 P n+1 , v)=J2 (x n+1 - k+J P n+1 ,u) a) 

j=k 



(n+1) 



and, from (2.2), 

-f n+1 (x n - k P n7 v) +/3 n+1 (x n - k P n+u v) + a n+1 (x n - k P n+2 ,v) 

n-k+l 






Pn+l,v) 



ln+1 (x n - k +iP n ,u)+[3 n+1 (x n - k +3P n+1 ,u) + a n+1 (x n - k+ JP n+2 ,u) = 

= (x n+1 - k+ JP n+1 ,u). 

Substituting (2.5) into (2.6) we obtain 
7n+1 ^<^- fc+ ^ n , W )af ) +/3 n+1 j^ (x n - k+ ip n+1 ,u)a<f +1) + 



(2.6) 



(2.7) 



j=k 



j=k+l 



+ a n+1 J2 {x n - k+3 Pn + 2,u)af +2) = Y J {^ +1 ~ k+3 P n+ i^)a J 



(n+l) 



j=k+2 



j = k 



From this equality, taking into account (2.7) and applying the induction hypoth- 
esis, we get 



7 n+ i (x n P n , u) 4 nJ - (x n+1 P n+1 , u) 4' 
Thus, by (2.4), we conclude that a). = a k • 



,(n+l) 



e. 



Let us denote by (a_j)J_ the solutions of (2.1) for n>r, which we have proved 

r 

that are independent of n. If A{x) = \^ a j x ^ and w = v — uA, 

3=0 






k P n ,w) = (x k P n ,v)-J2( xk+Jp mu)a j , 0<k<n, n > r. 

3=0 



When < k < n — r — 1, the orthogonality with respect to u and the quasi- 
ortogonality with respect to v give (x k P n ,w^ = 9. Furthermore, {x P n ,w) = 9 
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for n — r < k < n too, because (aj)J =0 are the solutions of the equations (2.1). 
So, w = 0, according to Proposition 2.2. 

To prove the uniqueness of the matrix polynomial A, let us suppose A,Be 
p(m) suc j 1 that v = UJ \ = u j$ Then, from (i), r = degA = degB since (P n ) n >o 
can not be quasi-orthogonal of different orders with respect to v. Hence, A(x) = 

r r 

/ a,jx\ B(x) = y^ bjX-i with a r , b r ^ 9. From 

r r 

(x k P n , V) = Y, (x k+J Pm U) a 3 = Y, (x k+J Pn, u) b, , 

j=o j=o 

and taking k — n — r, . . . ,n, we find 

(x n P n ,u}(a r -b r ) = e, 






n+r P n , u) (a r -b r ) + --- + (x n P n , u) (a - b ) = 0. 



Therefore, a, = bj for j = 0, • • • , r. □ 



4 Semi-classical functionals 

We consider the derivative operator on the space p( m ) as the linear operator 
D : P(™)' -»■ P( m )' such that (P,Du) = - (P',u) . From this and the definition of 
the right-product it is straightforward to prove that D{uA) = (Du)A + uA' , for 
all «6PW and A eP (ra '. 

4.1 Definition Let u G p( m <> be quasi-definite. We will say that u is semi- 
classical if there exist A, B G p( m ) ? with det A ^ 0, such that it is verified the 
distributional equation D (uA) = uB. We will also say that the corresponding 
sequence of left orthogonal matrix polynomials (P n ) n >o IS semi-classical. 

Remarks 

(i) Given u G p( m ) ; the linear functionals Uij : p( m ) — *■ C (i,j = l,---,m) 



defined by Uij(P) = u{P)ij, VP G p( m ), are called the components of u. 

m 

Then, (uA)ij = y^UikA^j with the obvious definition for the multiplication 

fc=i 
of Uij by a scalar polynomial. Therefore, det A ^ is the minimal requirement 
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to ensure that the previous definition will involve to all the components of the 
functional u. 

(ii) Notice that the matrix polynomial B of the Pearson-type equation of a semi- 
classical matrix functional can not be null. Otherwise, the Pearson-type equa- 
tion implies D(uA) = and, then, det(Ck+j) p k 7=0 = 0, where p = deg A. This 
contradicts the non-singularity of u. 

4.2 Lemma Let u G P (m) ' such that D (uA) = uB. Then, for every C G P (m) , 

D (uAC) = u {AC + BC) 

Proof It is just a consequence of the rule for the derivation of the right-product. 

□ 

The previous result implies that, for every u G P ( - m - ) , the set 
A u = {Ae P (m) | D(uA) = uB,Be P (m) } 
is a right-ideal of p( m ). 

When dealing with scalar semi-classical functionals we arrive in this way to 
an ideal of P := PW, which is necessarily generated by a unique (up to non-trivial 
factors) polynomial. This generator is used to classify the semi-classical scalar 
functionals (see [11, 12]). 

Since in the matricial case, a right-ideal of p( m ) is not necessarily principal, 
we cannot use A u for the classification of semi-classical functionals. However, if 
u G p( m ) is semi-classical, then there exists a G P \ {0} such that al G A u . 
To see this just notice that if A G A u with det A ^ 0, then (det A) I G A u since 
(detA)I = A ad] A. Therefore, for every semi-classical matrix functional u G p( m ) , 
the set 

■& u = {a G P | D(ual) =uB,B G P (m) } 

is a non-trivial ideal of P. We can use the "essentially" unique generator of this 
ideal to classify the semi-classical matrix functionals similarly to the scalar case. 

4.3 Definition Let u G p( m ) be semi-classical and let a G P\{0} be a polynomial 
with smallest degree such that D(ual) = uB, B G p( m ). Then, we say that u is 
of class s = max{p — 2, q — 1}, where p = deg a and q = deg B. 

Remarks 

(i) The preceding discussion shows that this definition is well done: it always 
exists such a polynomial a and the definition of class does not depend on the 
choice of a. 
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(ii) When u £ p( m ) is semi-classical there exists A £ A u with det A ^ 0, but 
it is possible for the leading coefficient of A to be singular. However, there 
always exists A £ A u with non-singular leading coefficient. To see this just 
take A= (det A) I. 

The following theorem gives the first characterization of the semi-classical 
matrix functionals. 

4.4 Theorem Let u £ p( m ) be quasi-definite and let (P n ) n >o be the correspond- 
ing sequence of left orthogonal matrix polynomials. Then, the following statements 
are equivalent: 

(i) u is semi-classical. 

(ii) There exists v £ p( m ) such that (P^ +1 ) n>Q is quasi-orthogonal with respect 
to v, and (P n ) n > is regularly quasi-orthogonal with respect to v. 

Proof 

(i) => (ii) If u is semi-classical, then D (ual) = uB, a £ P\ {0}, B £ p( m ), 
according to the previous comments. Therefore, 

(x k P n )' ,ual\ + (x k P n ,uB) = 0. 

q 
Let a(x) = y^ctjxi , B(x) = /^bjX^ with a p ^ 0, b q ^ 9. With this notation the 

J=0 j=0 

above equation becomes 

P Q 

J2 (kx k ~ 1+J P n , u) aj + J2 (x k+Jp n, u) bj = - (x k P' n , ual) . (3.1) 

J=0 j=0 

The left hand side of this expression is equal to the zero matrix if0</c<?i— p 
and < k < n — (q + 1). Let r = max{p — 1,^}. Then, (x k P' n ,uaI*) = if 
< k < n — r — 1. For k = n — r, (3.1) becomes 

(n - r) (x n P n , u) a r+1 + (x n P n , u)b r = - (x n ~ r P^ ual) , 

and thus 

(x n - r P' w ual) = - (x n P n , u) \(n - r)a r+1 + b r ] , 

which is equal to the zero matrix for at most one n. So, the sequence (Q n ) n >o > 
with Q n = Pn + i, is quasi-orthogonal of order r — 1 with respect to ual. 

Obviously, since al has non-singular leading coefficient, (-Pn) n >o is regularly 
quasi-orthogonal of order p with respect to ual. 
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(ii) =>- (i) If (Pn) n >o is regularly quasi-orthogonal of order p with respect to v, 
there exists A G p( m ), with deg A = p and non-singular leading coefficient, such 
that v = uA. Notice that if the leading coefficient of A is non-singular, then 
det A ^ 0. 

Let w = D(uA). (P, w) = — (P', uA) and for P = x k P n we have 
<x fc P n , w) = -k (x k - 1 P n , uA) - {x k P' n , uA) . 

From the quasi-orthogonality, the right hand side vanishes if < /c < n — p and 
< k < n — s — 2, where s is the order of quasi-orthogonality of (P^+i) n>0 - So, 
(P n ) n >o i s quasi-orthogonal with respect to the functional w with order at most 
maxjj) — 1, s + 1}. Thus, there exists P e P^" 1 -* such that degP < max{p — 1, s + 1} 
and w = uB. D 

Remark Notice that if (P^ +1 ) n>0 is quasi-orthogonal of order s with respect 

to ual, then D(ual) = uB where dega = p, degP = q, and this implies that 
s = maxjj) — 2, q — I}. 



5 Structure relation and differo-differential equation 

5.1 Theorem Structure relation 

Let u G p( m ) be quasi-definite and let (Pn) n > be the associated sequence of 
left orthogonal matrix polynomials. Then, the following statements are equivalent: 

(i) u is semi-classical. 

(ii) There exist a polynomial a G P\{0} and matrices Q n G C*- m ' m - ) (n > 0, — s < 
j < p), such that 

a{x)P' n+1 {x) = J2 ®fPn +j (x), 

j = ~s 

where p = deg a, s > m&x{p — 2, 0} and 0_ s 7^ 9 for some n > s (we use the 
convention P/- = 9 for k < 0). 

Proof 

(i) =^- (ii) If u is semi-classical, then there exist a G P \ {0} and B G p( m ) such 
that D(ual) = uB. Hence, (-Pn+i) n >n ^ s quasi-orthogonal with respect to ual of 
order s = max{p — 2, q — 1}, where p = deg a and q = deg B. This implies 



,.!•' 



,k -n' 



P^ +1 ,ual) = 9, 0<k<n — s — l, 



(x n s P' n+1 ,ual) 7^ 9, for some n > s. 
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Since aP^ +1 E P„+ p , there exist 6 > n verifying (ii). 

(ii) =>- (i) Let v = ual. Then, 

p 
{x k P' n+l ,v) = J2 @ t ] (x k Pn +J ,u) = 6, 0<k<n-s-l, n> 



3= — 8 

and, since (x n ~ s P n - s ,u) is non-singular 

„n—s r» „.\ C\( n ) 



X 



p; +1 , v ) = eL n ; (x n - s P n - s , u) # e, for some n > s. 



Thus, (P4+i) n > ^ s quasi-orthogonal of order s with respect to v. Obviously 
(P n ) n>0 is regularly quasi-orthogonal of order p with respect to v and, therefore, 
u is semi-classical. D 

Remark The proof shows that if u is semi-classical with a Pearson-type equation 
D(ual) = uB, then there is a structure relation with integers p = dega and 
s = max{p — 2, q — 1}, where q = deg £?. 

5.2 Theorem Differo — differential equation 

Let u E p( m ) be quasi-definite and let (P n ) n > be the corresponding se- 
quence of left orthogonal matrix polynomials. Then, the following statements are 
equivalent: 

(i) u is semi-classical. 

(ii) There exist two polynomials a, (3 E P, a 7^ 0, and matrices A^ £ C*- m ' m - > 
(ft > 0, — s < k < s) , such that 

s 

a(x)P^(x) + f3(x)P' n (x) = J2 ^ } Pn + k(x), 

k= — s 

where s > max{p — 2, g — 1}, p — deg a and q = deg (3 (we use the convention 
P k = 9 fork< 0). 

Proof 

(i) =>- (ii) Let u be semi-classical. By Theorem 5.1, there exist a polynomial 
a E P, with dega = p\ > 0, a non-negative integer si > pi — 2 and matrices 

e (n) e C ( m , m ) ^ > q^ _ Si < j < pi ) 7 suc h that 

a(x)P' n (x) = £ e^-^Pn-i+iCx). (5.1) 

j — -Sl 
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Taking derivatives in (5.1), we obtain 



Pi 



3— -si 



a 2 P'; + aa'P' n = £ Of^aP^ 



Using (5.1) in the right hand side of the above equality, 

pi pi 



a 2 



PZ + aa'P> n = J2 ©r i} E ®t 2+j) Pn-2 +J+ k, 

3——S1 k——S\ 



Let us denote a — a , f3 — aa\ p = 2pi = degct, q = 2pi — 1 = deg/3, s 
max{2 Pl -2,2 Sl + 2}andAi n) = ]T Q^ Q^-^ ■ Then > 



3—-S1 



<*k+pk = E A i n)p ^ 



s 

(n) 



J=-s 



with s > max{p — 2, g — 1}. 

(u) =>- (i) Without loss of generality, assume that (P n )n>o is the sequence of monic 
left orthogonal matrix polynomials with respect to u, that is, P n (x) = x n I + ■ ■ ■ . 
Denoting by (Il n ) n >o the sequence of monic right orthogonal matrix polynomials 
with respect to -U, the dual basis (7Tn) n >o °f (-Pn) n >o i s given by ir n = iiUnE^ 1 , 
where E n := (x n P n ,u). Let (Qn) n>0 be the basis of P( m ) given by Q n = P^+i, 
and let (p n )n>o be the corresponding dual basis. We have the relation 

Dp n = -TTn+l, U > 0. (5.2) 



Consider for every n > the linear functional — D(n n a) + n n [3. There exists 
( X^) C C( m ' m ) such that 



-D(n n a) + Tv n (3 = ^ p k \ 
with 



oo 

(n) 
k ' 
k=0 



[Qji-Dfr^ + irnP) = J2(Qj>Pk) X l n) = ^\ 3 > °- 



(n) _ x (n) 
k=0 

Hence, our hypothesis implies that 

» _ V^ . (j + 1) 



Ai n) = E A? +1) * i+1+M . (5.3) 



k= — s 
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>(0) 



For n — 0, (5.3) leads to A^ = if j > s, and 

s-l 



-D(ir a) + n (3 = ^ Pk^ 
or, equivalently, 



(0) 

k > 

fc=0 



-Dfaa)^ 1 + u/?^ 1 = ^p fc A^ 



s-l 

(0) 



If we denote A^. = X k Eq : then 



£;=0 



s-l 

;(o) 



D(ua)+u(3 = J2PkK°- (5-4) 



fc=0 



In a similar way, for n = 1, (5.3) gives 

s 

-D(uani)£?f x + uPU^ 1 = ^ p fc A 



(i) 
k > 

fc=0 



because A,) ; = for j > s + 1. Thus, 



-D(uani) + u/HIi = ^ p fc A^ , (5.5) 



fc=0 



with Aj^ = A^^i. 



fc=0 



Keeping in mind that P±(x) = xl + M, (5.5) can be written as 

s 

—D(ua)Hi — ual + uflHi = \^ PkX\ 
and, using (5.4), we get 

-uaI = ± Pk X^-[^PkXt ) )li 1 . 

fc=0 \fc=0 / 

Taking derivatives in the above equality and applying (5.2) we obtain 



D(ual) = ;>> +1 A^ - J> + iA<°> n, + J>A<°> = 

fc=0 \k=0 / fc=0 

s /s-l \ s-l 



fe=0 \fc=0 / fc=0 
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As a consequence, the polynomial \1/ G p( m ) given by 

* : = ^n^-^AW - [J2^+i E klA 0) ) ni, 

fc=0 \fc=0 / 

whose degree is not greater than s + 1, verifies that 

s-1 
fc=0 

From this and (5.4) we get 

D(ual) = uB 

* + j3I 
with B = , that is, u is semi-classical. □ 

2 ' 

Remarks 

(i) The distributional equation D{ua) = u(3 for a scalar functional u implies that 
the corresponding orthogonal polynomial sequence (p n ) satisfies 

(x k (ap^ + /3j/ n ), u) = ((x k p'J, ua) - k(x k - 1 p' n , ua) + (x k p' n , u/3) = 
= —k(x k ~ 1 p' n ,ua), 

which vanishes for /c = 0, . . . ,n — s — 1. So, 



<Vn + PPn = Yl X ^Pn+k- 



In the matricial case, the equality D(ual) = uB does not imply (x k P' n , uB) = 
(x BP^u) due to the non-commutativity of the matrix product, and, more 
generally, it is not possible to ensure the existence of matrix polynomials B n 
such that ( x x k P l n ,uB^ = (x k B n P' n ,u} neither. Therefore, we can not obtain 
in this way a differo- differential equation with the polynomial B explicitly. 

(ii) Obviuosly, the differo- differential equation given by Theorem 5.2 is not unique 
because we can modify it by adding any structure relation. 

(iii) The proof of Theorem 5.2 shows that, if u is a semi-classical functional with 
a Pearson-type equation D(ual) = uB, dega = p\, degB = qi, then 

j--s 
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where p = 2p\ — 2, and s = max{2pi — 2, 2s\ + 2} = {2pi — 2, 2q{\, according 
to the remark of Theorem 5.1. 



6 Some examples 

We will illustrate the preceding results with some examples. 

1. In the first one, we consider the scalar Laguerre weight w = xe~ x that verifies 
the Pearson-type equation 

{wx)' — w{2 — x). 

Now, we define the positive definite matrix weight function 

that also verifies a Pearson-type equation. In fact, 

2-x -2 



D(uxI) = uB(x), B[xj , Q A _ x 



Moreover 



, t , xe x \ 1 

"= TuT =i * 8 e-*)' T= -1 1 



i.e., u is congruent with a diagonal weight with Laguerre classical scalar weights 
in its diagonal. Let In be 1 
weight x a e~ x , a > — 1. Then, 



in its diagonal. Let In be the monic Laguerre polynomials asociated with the 



n(x) "V l£\x) 

constitutes a sequence of matrix orthogonal polynomials related to the diagonal 
weight u. Obviously u satisfies a Pearson-type equation, 

ft\-l TD/„\rrit (2 — x 



D(uxI)=uB(x), B(x) = (T t )-'B(x)T x Q ( _^ 

and a sequence of matrix orthogonal polynomials associated with u is given by 

P n (x) = L n (x)T, n > 0. 
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It is easy to obtain the structure relation 

xP: +1 (x) = (n + l)P n+1 + (n + l)(" + 2 n ° 4 )^(x), 

as well as, for all n > 2, a differo-differential equation 
x 2 P' 7 [{x) + xP' n {x) = n{n + l)P„(x)+ 



+ «(«-l)(" ( " +1) (B + 2) ° (n + 3))^W- 



Moreover, the polynomials /n satisfy the differential equation 

x4 a) "(x) + (x - a - l)4 a) '(x) = n4 a) (x), 
and so, we can obtain for the polynomials P n the following differential equation 

xP"(x)+( X Q x _ 4 JP^(x)=nP n (x). 

For the monic polynomials P n = T~ 1 P n we have 

xPZW + T- 1 ^- 2 x _^TP^x)=nP n (x), 

and then 

xP^(x) + B\x)P' n {x) = nP n (x). 

The polynomials P n can be obtained by means of a Rodrigues-type formula 
F„(*) = i(T<T)-Vi' f ^(T*T). 



2. Now, we consider the matrix weight 

1 



u := e x R(x), R(x, , _, 

1 1 x 

Notice that u is hermitian but not positive definite, u verifies the distributional 
equation 

u' = e- x2 R'(x) - 2xe~ x2 R(x) = u(-2xl + R- 1 (x)R'(x)) = uB(x), (6.1) 
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-2x 1 

where B(x) = ( ) . Moreover, u has no congruent diagonal weight. 



Let 



P n (x) = {-2)^e^S-\x) dn{e ~ X d J {x) \ n > 0, (6.2) 



where S(x) = [ ) . Derivating (6.2) we get 



P n +i(aO = ~\ {-2x1 + S- 1 (x)S'(x)) P n (x) - ^i*(x), 



P n+1 (x) = - 1 - (B t (x)P n (x) + P' n {x)) . (6.3) 

This shows that P n is a monic matrix polynomial of degree n because Pq(x) = I. 
Applying the Leibnitz rule for the derivatives in (6.2), we have 

„_ 2 -» c _i, , d^ l {-2xe- 2 S(x) + e-* 2 S'(x)) 



P n+2 (x) = (-2)- n ~V S- L (x) 



dx n+1 



{-2)---^S-\x) dn+1 ^^ x) K 

+ {n + 1){ _2)- n - 1 e x2 S- 1 (x) dn{e ~*J {x)) + 

- n -2^o-u^a^^o-u^ dn+1 (e- x2 S(x)) 



+ (-2)" n "V S- 1 (x)S'(x)S- 1 (x)- 



dx n+1 
So, we obtain the recurrence relation 

P n+2 (x) = -^ (B t (x)P n+1 (x) + (n + l)P n (x)) . (6.4) 

As a consequence, (P n ) is a sequence of left orthogonal matrix polynomials with 
respect to certain non-singular functional. Notice that (6.4) implies that the matrix 
coefficients E n = (x n P n ,u) = (P n ,P n ) u are given by 

E n = —r^nlE , 

so, the orthogonality matrix functional can not be positive definite. 

Moreover, from (6.3) and (6.4) we can obtain the structure relation 

P' n {x)=nP n - 1 {x), (6.5) 
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and, by derivation, the differo-differential equation 

P^(x) = n(n-l)P n - 2 (x). (6.6) 

So, (P n ) is a semi-classical matrix orthogonal polynomial sequence. Besides, from 
(6.3) and (6.5) it follows the differential equation 

P n '(x) + B t (x)P' n (x) = -2nP n (x). (6.7) 

Indeed, (P n ) is a sequence of left orthogonal matrix polynomials with respect 
to the matrix weight u. In fact, 



P' n {x)e- X R(x)P' k t {x)dx = 

(P"(x) + (-2xI + R 1 (x)R- 1 {x))Pl l {x))e- x2 R{x)Pl{x) dx = 

/OO 
P n (x)e~ x2 R(x)P t k (x) dx, 
-00 

the last equality obtained from equation (6.7). In the same way, we get 

/OO /»00 

P n (x)e~ x2 R(x)P' k t (x)dx = -2k / P n (x)e~ x2 R(x)P t k (x)dx. 
-OO J — OO 

So, we have the orthogonality of (P n ) with respect to u. 

Summarizing, u is a quasi-definite functional such that: 

(i) It satisfies the distributional equation Du = uB. 

(ii) The corresponding sequence of left orthogonal matrix polynomials (P n ) are 
given by the Rodrigues-type formula (6.2). 

(iii) (P n ) verifies the structure relation (6.5), the differo-differential equation (6.6) 
and the differential equation (6.7). Keeping in mind Remark (i) to Theorem 
5.2, notice that we have (6.7) because uB = (uB) 1 = B l u and B t P n = P n B l . 

3. In the last example, we consider the Jacobi scalar weight w = 1 — x 2 that 
satisfies the distributional equation 

(w(l - x 2 ))' = -w4x, are [-1,1]. 
We define the matrix weight function, 

u:=w(l 2 jO, xeI-1,1], 
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that verifies the Pearson-type equation 

D{u(l-x 2 )l)=uB(x), B(x) = (^l X ^fx)- 

Notice that u is positive definite and then, there exists a sequence of left orthogonal 
matrix polynomials (P n ) related to u. Moreover, u is not congruent with a diagonal 
weight. So, the sequence (P n ) satisfies a structure relation with s — 1, p — 2, and 
a differo-difierential equation of second order with the form 

2 
(1 - X 2 ) 2 P^(X) - X(l - 2 X )P' n {x) = J2 A nkPn+k. 

fc=-4 



Acknowledgements This research was supported by Direction General de Ensenanza 
Superior of Spain (DGES), Project PB98-1615. 

The authors are very grateful to Professor Francisco Marcellan for his remarks 
and useful suggestions. 



References 

[1] M. Alfaro, A. Branquinho, F. Marcellan, J. Petronilho, A generalization of 
a theorem of S. Bochner, Publicaciones del Seminario Matemdtico Garcia de 
Galdeano, Serie II, Seccion 1, mimero 11 (1992). 

[2] S. Bonan, D.S. Lubinski, P. Nevai, Orthogonal polynomials and their deriva- 
tives, SIAM J. Math. Anal. 18 (1987) 1163-1175. 

[3] T.S. Chihara, "An introduction to orthogonal polynomials", Gordon and Breach, 
New York, 1978. 

[4] A.J. Duran, On orthogonal polynomials with respect to a positive definite 
matrix of measures, Can. J. Math. 47 (1995) 88-112. 

[5] A.J. Duran, Matrix inner product having a matrix symmetric second order 
differential operator, Rocky Mountain J. Math. 27 (1997) 585-600. 

[6] A.J. Duran, W. Van Assche, Orthogonal matrix polynomials and higher-order 
recurrence relations, Linear Alg. Appl. 219 (1995) 261-280. 



334 M.CANTERO ET AL 



[7] J. Favard, Sur les polyndmes de Tchebicheff, C. R. Acad. Sci. Paris 200 
(1935) 2052-2053. 

[8] J.S. Geronimo, Scattering theory and matrix orthogonal polynomials on the 
real line, Circuits Systems Signal Process 1 (1982) 471-495. 

[9] E. Hendriksen, H. van Rossum, Semi-classical orthogonal polynomials, C.Brezinski 
et al. Eds., Lecture Notes in Math. 1171 (Springer, Berlin, 1985) 354-361. 

[10] F. Marcellan, H.O. Yakhlef, Recent trends on analytic properties of matrix 
orthonormal polynomials, Electr. Trans. Numer. Anal. 14 (2002) 127-141. 

[11] P. Maroni, Variations around classical orthogonal polynomials. Connected 
problems, J. Comput. Appl. Math. 48 (1-2) (1993) 133-155. 

[12] P. Maroni, Une theorie algebrique des polyndmes orthogonaux. Application 
aux polyndmes orthogonaux semiclassiques. C. Brezinski et al. Eds. Orthogo- 
nal Polynomials and Their Applications, IMA CS Ann. Comput. Appl. Math. 
9 (1991) 95-130. 

[13] G. Sansigre, Polinomios ortogonales matriciales y matrices bloques, Doctoral 
Dissertation, Universidad de Zaragoza (1992). In Spanish. 

[14] J. A. Shohat, A differential equation for orthogonal polynomials, Duke. Math. 
J. 5 (1939) 401-407. 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS,VOL.3,NO.3,335-345,2005,COPYRIGHT 2005 EUDOXUS PRESS.LLC 

An Euler-type quadrature rule 
derived by using Radon's method 

by Carlo Belingeri and Gabriella Bretti 

Rome University "La Sapienza" 



Abstract 

The use of Euler polynomials and Euler numbers allows us to con- 
struct a quadrature rule similar to the well known Euler-MacLaurin 
quadrature formula. 
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1 Introduction 

Among the composite quadrature rules the Euler-MacLaurin formula is of par- 
ticular importance since the extrapolation methods proposed by Romberg are 
essentially based on it. The Romberg formulas constitute one of the most effi- 
cient methods to estimate integrals (see [5]). 

It has been recently proved thet generalized quadrature procedures can be 
constructed employing Appell polynomials [4] and containing as a particular 
case either the Euler-MacLaurin formula or a similar formula using the Euler 
numbers (instead of the Bernoulli ones) and even derivatives (instead od odd) 
evaluated at the extrema of the considered integral. 

As it is well known, in [1] can be found a general theory of quadrature 
rules which is founded on a method going back to Radon, see [2] , reducing any 
quadrature rule known in literature to evaluate the integral of g(x)u(x) (g(x) 
weight function) to the construction of a suitable differential operator E, and 
of particular solutions of the adjoint equation E*((f>) = g. 

It is proved in [1] how the above mentioned operators can be chosed in order 
to recover the Gaussian quadrature rules and the Euler-MacLaurin formula too. 

In this paper we extend the Radon technique to the quadrature rule consid- 
ered in [3], whereas in [3] this formula is derived by using integration by parts. 
This new approach shows that the Radon method works even in this case. 
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2 Elementary quadrature rule in a finite inter- 
val 

In [1], chapter 2, an elementary quadrature formula which is valid for real- 
valued functions defined in a finite interval is constructed. The method can be 
summarized as follows. 

Let be g(x) G L[a, b] the weight, u(x) G AC" 1 ^ 1 ^, b] the argument function and 
denote by a = x < X\ < . . . < x m < x m+ i = b the nodes of the quadrature 
rule: 

„b n—l m 

/ g{x)u(x)dx = J2J2 A h^ h) (xi) + R(u) (2.1) 

Ja h=0 i=l 

associated to a linear differential operator E of order n 

n Jn—k 

E{u) = Y J a k {x)-— k u. (2.2) 

fc=0 ux 

We deal with the problem of finding the coefficients Au in equation (2.1) in 
order to satisfy the condition: 

E(u) = => R(u) = 0. (2.3) 

Let be 

r 

R = Vn.^, 

dx r ~ k 



E r = Y< a k( x )7Z^k> r = 0,l,...,n-l 

fc=0 



the reduced operators associated with the operator E, and let be 

n Jn—k 

E*(v) = E(-l) n ^^^K(^)^(^)] (aoW = 1)- (2-4) 

fc=o ax 

the adjoint operator of E. 

Then the following theorem is valid: 

Theorem 1. Denoting by E* the adjoint operators of the reduced operators 

E r , a necessary condition in order that (2.3) holds true is to assume: 

A hi = [E*-h-i(<Pi(x) - Vi-iix))]^ (2.5) 
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where tpi(x), . . . , tp m -i(x) are arbitrary solutions of the differential equation 

E*(<p)=g(x) (2.6) 

while ipo(x) and (p m (x) are integrals of the same equation (2.6) satisfying the 
initial conditions 

<p£\a) = 0, <p%\a) = 0, h = 0,l,...,n-l. (2.7) 

As a consequence the remainder R(u) can be written in the form: 

n Q III PCT, ' I 1 

R(u) = / (j){x)E(u)dx = Y, " ¥i{x)E{u)dx, (2.8) 

where <f>(x) = <fi(x), x G (xi,x i+ i), i = 0, 1, . . . ,m — 1, to. 

It is well known that if the function <f)(x) is continuous and does not change 
sign in [a, b], then the estimate of the remainder R(u) can be simplified (see 

[I])- 

3 The Euler-MacLaurin quadrature formula 

In [1], chapter 4, the Euler-MacLaurin formula is constructed using Bernoulli 
numbers. A synthesis of the method is as follows. 
Let us assume 

a = 0, b = 1, g(x) — 1, X\ — 0, X2 = 1, 

m = 2, n = 2^ + 2, £ = ^^, i/ = 0,l,... 

/ n B2u+2\X) — B2V+2 

^ {X) = (2, + 2)! ' 
where B2 V +2{x) and i32i,+2 are respectively Bernoulli polynomials and Bernoulli 

numbers. The coefficients in (2.1) are given by 

h even 

A hl . 

Bh + 1 h odd 
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h even 

fey h odd 



(h+i)i 
h = l,...,2v, 

Then (2.1) becomes 

j\(x)dx = i[«(0) + «(1)] + £ ^§y[u 2k -\0) ~ u 2k -\l)] + R(u) 

and (2.8), under the hypotesis u 2u+2 (x) G C[0, 1], becomes 

R(u) = f 1 B2 » + ? {x) - B y 2 » +2 u^){ x )dx = _^M„(^) (e) o < e < 1. 
v ; Jo (2i/ + 2)! v ; (2i/ + 2)! v ; 

4 Euler polynomials and Euler numbers: defi- 
nition and properties 

The Euler polynomials E n (x) are defined by the following generating function: 

n p xt oo ±n 

-TTl = E E n (x) - \t\ < vr; (4.1) 

the Euler numbers S n can be obtained by the generating function: 

E ^ (4-2) 



e* + e-* ^ n! 
and the connection between Euler numbers and Euler polynomials is given by 

En (l) =2 ~ n£n ' n = 0,1,2,... . (4.3) 

A recursive computation of the Euler polynomials can be obtained by using 
the following formula: 

E n (x) + £ UJ M*) = 2xn i n = 1, 2, . . . . (4.4) 
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The first Euler polynomials are given by 



1 3 1 

Eq(x) = 1, Ei(x) — x — -, E 2 (x) = x 2 — x, E 3 (x) = x 3 — -x 2 + - 



and the first Euler numbers are given by 

S = 1, £ 1 = 0, S 2 = -1, S 3 = 0, £ A = 5, £ 5 = 0, £ 6 = -61, £ 7 = 0, ... . 

For further values see [8], p. 810. 

The Euler polynomials satisfy the following relations: 

TV 

E "( x ) = T^ji E »-k(.x), fc = 0,l,...,n (4.5) 



E n (t)dt = E " +l( ^~f" +l(a) (4.6) 



*<*)=£ (*)*(*-*)"" (4 - 7) 

For x = and x = 1, by (4.7), can be obtained E n (0) and -E„(l), used in 
the following; the first values are given by 

E Q (0) = 1, E 1 (0) = - 1 -, E 2 (0) = 0, E 3 (0) = ±, E 4 (0) = 0, E 5 (0) = -±,... 

E (l) = l, £i(l) = ^, E 2 (1) = 0, E 3 (l) = ~, E 4 (l) = 0, E 5 (l) = ±,... 



5 The Euler-type quadrature formula 

In this section, starting from the Euler polynomials, we construct a quadrature 
rule similar to the well known Euler-MacLaurin quadrature formula, but using 
Euler (instead of Bernoulli) numbers in the remainder term, and even (instead 
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of odd) order derivatives of the given function evalueted at the extrema of 
the considered interval. In a forthcoming paper the same procedure will be 
generalized starting from the Appell polynomials. 
Let us assume 



a = 0, b — 1, g(x) — 1, x\ — 0, x 2 = 1, 



m = 2, n = 2z/ + 2, E 



d 



2u+2 



u = 0,l, 



dx 2 »+ 2 ' 

The differential equation E*(ip) = g(x) is to be written ip^ 2u+2 "> = 1. We need 
not to consider the solutions (po(x) , ip 2 (x) and it is sufficient to fix arbitrarily 
the solution tpi(x); for instance we may assume 

E 2v+2 (x) 



<pi(x) 



(2z/ + 2 

where E 2u+2 (x) denote the Euler polynomial of index 2v + 2. 

Equations (2.1) and (2.8) become, under the hypotesis u(x) G AC 2 ^ +1 [0,1] 

2i/+l 



(5-1) 



f 1 u{x)dx = £ [A hl u (h \0) + A h2 u {h \l)} + R{u) 
D / A f 1 E 2u+2 (x) < 2u+2 ), x , 



(5.2) 



(5.3) 



Now we have only to compute the coefficients A hl , A h2 using (2.5). We find 



A h i = [Et +1 _ h Mi =0 = (-ir +i - h ^ +i - h \o) 



A h2 = [E. 



„(2v+l-h) ,„, 
( -\\h+1^2v+2 W 
V L > (2u+2)\ 




h = 0, l,...,2z/ 


E 2 v+2(0) 
\ {2u+2)\ 




h = 2v + l 


E *2v+l-h(-¥l)] x= i = ' 


"(- 


-l) 2v+1 - h ( p? ,+1 - h \l 


„(2v+l-h), ■ 
( l\/l+l^+2 V 1 ) 
V L > {2u+2)\ 




h = 0,l,...,2u 


E 2 v+2(l) 
{2u+2)\ 




h = 2u + l 
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and therefore, recalling (4.5) and (4.7): 



.1.1 



h odd 
A hl . 

E h+1 (0) i 

1M3)I h eVen 

h odd 

+ v ; n even 



(h+i)i 



h = l,2,...,2v, 



A.2v+l,l — 0, A2u+1,2 — 0. 

Equation (5.2), recalling the property 

£ 2s +i(l) = -#25+i(0), s = l,2,..., 
becomes 

jT 1 «(x)dx = i[«(0) + «(1)] - £ ^M[« 2 *(0) + « 2fc (l)] + i2(«) (5.4) 
and (5.3), recalling (4.6), yields 

R(u) = u^ +2 \0 ^ p^4dx = ^+3(l)-^ +3 (0) (2 , +2)(0 = 
v ; ^7o (2fc + 2)! (2i/ + 3)(2z/ + 2)! y ^ J 

under the hypotesis -u^ +2 )(:r) G C[0, 1]. 

Then we have proved the following theorem: 

Theorem 2. Let tz(a;) G AC 2l/+1 [0, 1] then the quadrature rule 

j\{x)dx = 1[«(0) +«(1)] - £ |f^;K fc (0) +« 2fc (l)] + i2(«) 
holds true, where the remainder R(u) is expressed by 

D / n f 1 E2 U+ 2{X) (2u+2)/ \j 

R(u) = j o j^fit + ) (*)^ 
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6 Numerical example 



sin(i) 



Let us apply the quadrature formula (5.4) assuming u(x) = sm(x> ; y — 5. Then 



we found 



and consequently 

J = |(1.8414709) - -^-(-0.572466) + -^—(0.333073) - -^—(-0.232804) + 

31 691 
+ (0.277777) - (-0.090909) + R(u) = 0.946086418 + R(u), 

5461 

where #(«) = [-2 I 3^ r ]u (12) (0 = 0.00000087 • w( 12 )(£), < £ < 1. 
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1 Introduction 

A positive sequence (b n ) is said to be almost increasing if there exists a positive increasing 
sequence (c„) and two positive constants A and B such that Ac n < b n < Bc n (see [1]). 
We denote by BVo the BV n Co , where Co and BV of the null sequences and sequences 
with bounded variation, respectively. Let J2 a n be a given infinite series with partial sums 
(s n ). Let (p n ) be a sequence of positive numbers such that 



Pn = ^2Pv->°o as n^oo, (P_j = p_j = 0,i > 1). (I) 

v=0 

The sequence-to-sequence transformation 

I n 
U n = -^-^PvSv (2) 
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defines the sequence (U n ) of the (N,p n ) mean of the sequence (s n ), generated by the 

sequence of coefficients (p n ) (see [4]). 

The series J2 a n is said to be summable | N,p n \ k , k > 1, if (see [2]) 

oo 
E( P n/Pn) fe_1 | AC/ n _! | fe < 00 (3) 



n=l 

where 



Pn 



AU n -! = - p ^P,-ia„ n>l. (4) 

In the special case when p n = 1 for all values of n, | N,p n L sunimability is the same 
as | C, 1 | fc summability. Also if we take p n = ^-pthen | N,p n \ k summability reduces to 

I N -±- I 

I JV ' 71+1 Ifc- 

Quite recently Bor [3] has proved the following theorem. 

Theorem A. Let (X n ) be an almost increasing sequence and let there be sequences {(3 n ) 

and (A n ) such that 

I AA n |< (3 n (5) 

Pn — ► as n — > oo (6) 

oo 

J2 n | A/3 n | X n < oo (7) 

n=l 

| A n | X n = O(l) as n — ► oo. (8) 

If 

J2 l -^Ll = 0(l) as m^oo, (9) 

71=1 

m -I 

\" - I t n \ k = OiX m ) as m -> oo, (10) 

n=l 

and (p n ) is a sequence such that 



E jr I f « l*= °( x ™) as m - °°> ( n ) 

71=1 " 



where 



1 n 



</< — r^E ua *" ( 12 ) 



n + ^ , 
^=1 



then the series ^2 a nA n is summable | N,p n L for fc > 1. 
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2. The main result. The aim of this paper is to prove Theorem A under weaker 
conditions. For this we need the concept of quasi /3-power increasing sequence. A positive 
sequence (j n ) is said to be quasi /3-power increasing sequence if there exists a constant 
K = K{j3, 7) > 1 such that 

Kn^n > mP lm (13) 

holds for all n > m > 1. It should be noted that every almost increasing sequence is quasi 

/3-power increasing sequence for any nonnegative (3, but the convers need not be true as 

can be seen by taking the example, say 7„ = n~^ for j3 > 0. So we are weakening the 

hypotheses of the theorem replacing an almost increasing sequence by a quasi /3-power 

increasing sequence. 

Now, we shall prove the following theorem: 

Theorem. Let (A n ) € BVo and let (X n ) be a quasi /3-power increasing sequence for some 

< (3 < 1. If all the conditions of Theorem A are satisfied, then the series J2a n X n is 

summable | N,p n \ k for k > 1. 

Remark. If we take (X n ) as an almost incrasing sequence, then we get Theorem A. In 

this case the condition (A n ) £ BVo is not needed. 

We need the following lemma for the proof of our theorem. 

Lemma ([5]). Under the conditions on (X n ), (/?„) and (A n ) as taken in the statement of 

the theorem, the following conditions hold, when (7) is satisfied: 

nf3 n X n = O(l) as n — > 00, (14) 

00 

J2 PnX n < OO. (15) 

n=\ 

3. Proof of the Theorem. Let (T n ) denotes the (N,p n ) mean of the series ^a n \ n . 
Then, by definition and changing the order of summation, we have 

1 n v 1 n 

T n = — 2_^Pv 2^«iAj = — 2_^{P n — Pv-l)d v Xv 

Then, for n > 1, we have 

_ « r, n P \ 

rri rji Pn V - ^ D \ Pn V^ r V — \^V 

ln-ln-i - —— 2_^f v -ia v \ v - —— 2^ " Va ""- 
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By Abel's transformation, we have 

i 1 n—1 . -i n— 1 i -i 

T -T - U + l + \ _ Pn V" i \ V + 1 Pn V^ P A\ + U + 1 

-*n -*ra— 1 — p Pn^n^n p p / Pv^v^v id id / y ^v'-^-^v^v 

Tl-Tn r n r n —\ v —-± V -< n-* n— 1 ^ ^ 

n—1 -. 

+ ™ E ^wAv+l- 

= T nyl + T ny2 + T„ i3 + T nA , say. 
Since 

Tl i rri | ryi , /7-1 \k ^ A k ( I 'p |fc 1 I rfi \k 1 I 'p I k 1 I 'p I k \ 

n,l + + n,2 + -tra,3 + J n,4 | S 4 (J i n ,l \ + \ l n ,2 \ + | J-n,3 \ + I J n,4 | ), 

to complete the proof of the theorem, it is enough to show that 

oo 

Y.( P n/Pn) k - 1 \T n , r \ k < oo for r = 1,2,3,4. (16) 

n=l 

Firstly, we have that 

m m 

£(P«/Pn)* _1 I ^.l |* = 0{l)Y,T^\K\ k - 1 \K\\t n \ k 



i P ' 
n=l n=l 



o(i) £ I A « I ^ I * 



n=l 
m—l 



P„ 



Pv 



0(l)£A|A n |£f I* 



n=l u=l " " 

m 

0(1) I A m | E lr I tn \ k 
n=l ^ n 
m—l 



0(1) E I AA n | X n + 0(1) | \ m | X m 

n=l 

m—l 

0(1) E PnX n + 0(1) \\ m \X m = O(l) as m -► oo, 



n=l 

by (5), (8), (11) and (15). Now, when k > 1 applying Holder's inequality with indices k 
and k', where -r + p = 1, as in T ni i, we have that 

m+1 m+1 n—1 

E (^Pn/Pn)^ 1 | T„, 2 |* = OWETr^iE^l^lVjn 
n=2 n=2 r n^n-l v=1 
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1 n— 1 

^ n ~ 1 v=l 

m m+1 

= o(i)'£p v \x v \ k - 1 \\ v \\t v \ k £ /; " 



v=l 

m 



^+1 P nPn-l 



= 0(l)J2\^v\^-\t v \ k =0(l) as m^oo. 

v=l v 

Again, we have that 

m-i-l m+1 Ji— 1 

E^/Pn)"" 1 !^^ = O(l) £ -^{£ | AA, | P„ | tj fe } 
n=2 n=2 ^n^n-1 v=1 

1 n— 1 

x {p— E p «i A ^i}* _1 

^ n ~ 1 v=l 

m m+1 

= o(i)Y,PvP v \t v \ k y. "" 



v=-\ n=v+l P n P n-l 

m m— 1 -i 

0(l)£/3„|t w | fc =0(l)5>A,-|i«| fc 

U=l U=l 

m— 1 



m — L u -i lit -i 

0(1) 2 A(v(3 v ) J2 ~ I *i I fc +0(l)m/? m £ - | t v \ k 

V = l 1=1 V=l 

m— 1 

0(1) J2 | A(vA,) I X v + 0(l)m/3 m X m 

m— 1 

o(i) J2 I (« + 1 ) A A, - A I ^ + 0(l)m/? m X m 



t)=l 

m— 1 m— 1 

= O(l) J2 vX v I A A I +0(1) 2 I f3 v | X, + 0(l)m^ m X ri 

11=1 v=l 

= O(l) as m — > oo, 

by (5), (7), (10), (14) and (15). 
Finally, we have that 

m m+1 n—1 -, 

YtiPn/Pn)*- 1 I T nA \ k = O(l) £ — ^— 2 ^ I A.+1 || t„ | fc - 
n=l n=2 ^n-^-l „=l W 



x (p— E p « 



n— 1 
m -I m-t-± 

0(1)£P„|A„ +1 ||M^ 2 -£_ 

u=l V n=«+l ^""^-1 



1 ™ _1 I \ I 

m -i m+1 
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0(l)^|A„+i 



t \ k 



v=l 

m—l v ~\ rn 1 

o(i) X! A I Vn I E - 1 l r \ k +°( 1 ) I A ™+i I E - 1 f - 

m—l 



— 0(1) 2_^ I AAu + i | X v+ i + O(l) | A m+ i | X m+ i 

v=l 

m—l 

= O(l) E A+i^+i + 0(1) | A m+ i I X m+ i 

v=l 

= 0(1) as m —► oo, 
by (5), (8), (9), (10) and (15). Therefore, we get that 

m 

£0Pn/p») fc_1 I r„, r | fc = O(l) os m^oo, /or r = 1,2,3,4. 

n=\ 

This completes the proof of the theorem. 

If we take p n = 1 for all values of n in this theorem, then we get a new result concerning 

the | C, 1 \k summability factors. 

Also, if we take p n = —r^ in this theorem, then we get another new result concerning the 

| N, ^rf \k summability factors. 
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assume respectively either Gaussian or stable non-Gaussian unconditional dis- 
tributed index returns. Then, we approximate discrete time optimal allocations 
assuming returns following an ARMA process. Finally, we describe further au- 
toregressive portfolio choice models. 
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1. Introduction 

Over the last fifty years, the problem of optimal portfolio selection has lost none 
of its allure or importance for the financial community. Just consider the fol- 
lowing reason, which will appeal immediately to every investor. At a time when 
more than fifty percent of all financial assets in North America are controlled 
by pension or mutual funds, a lot of people apparently employ someone else 
to manage their money. Also, they are obviously willing to pay high fees or 
expenses for these services and are naturally very interested in how well "their" 
funds are performing. 

The mean-variance analysis, developed by Markowitz and Tobin, general- 
ized into an equilibrium theory by Sharpe, Lintner and Mossin and into an 
inter -temporal theory by Samuelson and Merton, was the first theory to give 
rigorous results to the portfolio selection problem in terms of the mean and the 
variance. However, many criticisms and empirical rejections have underlined 
the intrinsic limits of the mean-variance approximation. Probably Roll in [23], 
[24] , [25] was the first to clearly understand the weaknesses of the theory and 
the empirical deficiencies. On the other hand, the fundamental work of Mandel- 
brot [15], [16] and Fama [10] has sparked considerable interest in studying the 
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empirical distribution of financial assets. The excess kurtosis, found in Man- 
delbrot's and Fama's investigations, led them to reject the normal assumption 
(generally used to justify the mean-variance approach) and to propose the sta- 
ble Paretian distribution as a statistical model for asset returns. The Fama and 
Mandelbrot's conjecture was supported by numerous empirical investigations 
in the subsequent years (see, among others, Mittnik and Rachev [21], Rachev, 
Ortobelli and Schwartz [22]). This paper presents and discusses conditional and 
unconditional portfolio selection models for returns with heavy tails. 

Firstly, we examine the case of sub Gaussian a-stable distributed returns. 
This assumption permits a mean risk analysis pretty similar to the Markowitz- 
Tobin mean variance one. As a matter of fact, this model admits the same 
analytical form for the efficient frontier, but the parameters in the two models 
have a different meaning. Therefore, the most important difference is given by 
the way of estimating the parameters. In order to compare the performance of 
Gaussian and stable models we analyze an investment allocation problem. It 
consists of the maximization of the mean minus a measure of portfolio risk. The 
comparison made between the stable sub-Gaussian and the normal approach 
in terms of the allocation problem has indicated that the stable sub-Gaussian 
allocation is more risk preserving than the normal one and can give more oppor- 
tunities of earning. Precisely, the stable approach, differently from the normal 
one, considers the component of risk due to the fat tails. 

Secondly, we examine dynamic portfolio choices when returns follow an 
ARMA(1,1) model. Thus, in the multistage portfolio allocation problem we 
analyze the investor's choices considering an ARMA(1,1) model for the future 
scenarios of portfolio returns. Then, we compare investor's optimal allocations 
obtained when the residuals are either a-stable distributed or Gaussian distrib- 
uted. Thus, in order to value the impact of these distributional assumptions, 
we propose and examine an investment allocation problem. 

In Section 2 we introduce portfolio theory when returns are unconditionally 
stable distributed. In Section 3 we compare the stable sub-Gaussian multivari- 
ate approach with the normal multivariate one. Section 4 proposes a comparison 
among multi-stage conditional portfolio choice models. Finally, we briefly sum- 
marize the results. 

2. The sub-Gaussian ^-stable model 

In this section, we analyze the problem of optimal allocation among n + 1 
assets: n of those assets are stable distributed risky assets with returns z = 
[zi,...,z n ]', and the (n + l)th asset is risk-free with return Zq. 

Assume the vector of risky returns z = [zi,...,z n ]' is sub-Gaussian a-stable 
distributed with 1 < a < 2. Then, the characteristic function of z has the 
following form 

$ z (£) = E(exp(it'z)) = exp (- (t'Qt)^ +it'fi\ , (1) 
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r A ' 
where Q = \ofA = -f is a positive definite (n X n)— matrix, fi = E(z) is the 



mean vector. The term qf- is defined by 



a 2 

% r~ ~i ii.~ii2-c 



Zi , Z A 



2 l-" "JJ Q ll^lla > (.4) 

where 5j = £j — /ij is the centred return, the covariation [zi,Zj] between two 
jointly symmetric stable random variables Z{ and Zj is given by 

[zi,Zj] a = Si kjT" 1 sgn(sj)"j(ds), 

in particular, ||^-|| = ( fc. | s j| a 7(^ s )) = (Wj> Zj] a ) a • Here "f(ds) is the 
spectral measure and it has support on the unit circle S'2. 

This model can be considered as a special case of Owen-Rabinovitch's ellipti- 
cal model (see Owen and Rabinovitch [19]). However, no estimate procedure of 
the model parameters is given in the elliptical models with infinite variance. In 
our approach we use (1) and (2) to provide a statistical estimator of the stable 
efficient frontier. To estimate the efficient frontier for returns given by (1), we 
need to consider an estimator for the mean vector ji and an estimator for the 
dispersion matrix Q. The estimator of ji is given by the vector fi of sample 
averages. Using lemma 2.7.16 in Samorodnitsky, Taqqu [27] we can write for 
every p£ (1, a) 

i^L_ E {^ {p - 1] ) (3) 



1 1 a 



E(\z 3 \ p ) 



where Zj = sgn(zj) \zj\ p , and the scale parameter can be written (7jj 

= \\zj\\ . Then, Ojj can be approximated by the moment method suggested 
by Samorodnitsky, Taqqu in [27] Property 1.2.17 in the case (3 = for every 
p G (0, a) 



a. 



P Jo °° u P 1 sm w ^ u 

33 ~ N^lla ~~ 2P~ l T (l - E ) 



p — lie-. IIP 



E(\z 3 \ p ). (4) 



Moreover the following lemma holds. 
Lemma 1 

For any sub-Gaussian a-stable distributed vector z = [F 1; ...,z n ]' with null mean 

and 1 < a < 2, it follows that a p - = 2PV /\Sz) v tl±±\ E (l^i D f or an V P e I ' a ) 

and ?f = a 2 -j p 2pV ,\Sz\ v (1±l\ E {z i z j {p ' 1) j for any p G [l,a). In particu- 
lar, when the vector z = [z 1 ,...,z n ]' is multivariate normal distributed with 
null mean and variance co variance matrix V = [vfA, it follows that v P - 

-E(\zA p ) for any p > and vf- = v 2 T p »„_/« E (z i z i {p ~ 1) ) for 



zA p ) for any » > and vf, = v 2 T p 3p _A E (ziZ^ 1 " 1 ) fo 

J JJ 2 2— r(zi±\ V / 



any p > 1. 
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Proof Suppose we have a sequence of random variables 

X n £L p (n) = lx\ J \X\ p dfi< ool 

that converges in distribution to a random variable X £ L p (jj,), then the mo- 
ments curves g n (q) = E(\X n \ q ) (that are analytic functions), as n — > oo, con- 
verge uniformly to g(q) = E(\X\ q ) in the interval [0,p]. Consider a sequence 
X n = S an (l, 0,0), where a n £ (p, 2) and a n /* 2, then the sequence X n con- 
verges in distribution to a Gaussian random variable X with null mean and 
variance equal to 2. Thus, 

lim £(|XJ P ) = lim r+oa A, !»i , = 

n->oo vl ' ' 7l->OOPJo « p sm' iidti 

= g'-^Ci-f) = M|xn = 2^(41) 

p/ +o ° u-p- 1 sin 2 udu ^ ' ' v^ 

and J„ m _p_1 sin wdw = ^ wn±± X~- Because 

4_ a2 E {^ p - 1] ) _ j2 - P r(i-§)A / (p _ x) x 
2 «' £(|^f) " " 2ir (i - ^) r (£±i) v J / 



can be 



and for any i,j = l,...,n the elements of dispersion matrix Q 

definded 

qf. a 

-y = (-4(a,p)) p f{p,Zi,Zj) for every p G [l,a) (5) 

where A(a,p) = ^^^ , and /(p, 2^) = £ (^ (^) <P_1> ) Wif))^ 
In addition, as a / 2, then Q — > -j, where l 7 is the variance-covariance matrix 
and considering that f(p) = f(p, Zi,Zj) is an analytic function the thesis holds. □ 
The above suggests the following estimator Q = -£- for the entries of the 
unknown covariation matrix Q 



%_ d 2- v _ r(l-§)V5F 1 ^„ (kW „ (k ^(p-i) 



*^>^f-\ ( „ 



V a J \ 2 / fc=l 

where the Oj is estimated as follows 



~ 2 j f r(i-|)^ ifi (t) 
n 2 ^r(i-S)r(^)^^l J 



(7) 



It is important to observe that "the best" p depends on a and on the number 
of observations we have. The rate of convergence of the empirical matrix Q = 

to the unknown matrix Q (to be estimated), will be faster for a large 



2 



PORTFOLIO CHOICE WITH HEAVY TAILED DISTRIBUTIONS 357 



sample, if p is as small as possible, (see Rachev [20], Lamantia, Ortobelli and 
Rachev [13]). 

Under these assumptions, the wealth W = x'z + (1 — x'e)zo associated to 
the portfolio x is given by 

W = x'z+ (1 - x'e)zo = S a (a w ,Pw, E{W)) 
and W = Zq when x = 0, 



where a is the index of stability, Oyy = o x 'z = \/x'Qx is the scale (dispersion) 
parameter , fiw = ftx'z = is the skewness parameter and E(W) = x'E(z) + 
(1 — x'e)zQ. Recall that, when Ow x < Vw 2 , Pw-l = Pw 2 an d E{W\) = E(W2), 
then W\ second order stochastically dominates W<i and every risk averse investor 
prefers W\ to W2 (see Ortobelli [18]). Thus, when the returns z = [z±, ..., z n ]' are 
jointly sub-Gaussian a— stable distributed and unlimited short sales are allowed, 
every risk averse investor will choose an optimal portfolio among the portfolio 
solutions of the following optimization problem: 

min x'Qx subject to 

x j (^) 

x'fi + (1 — x'e)zo = mw 

for some given mean mw- Therefore, every optimal portfolio that maximizes a 
given concave utility function u, belongs to the mean-dispersion frontier 

if m> Zq 



_ } \/(^-ez )'Q 1 (fi-ez ) ,- Q -, 

^ £fliS if m < z ' l ' 

y'(fi-ezo)'Q 1 (fi-ez ) 

where fi = E(z); m = x'fi + (1 — x'e)zo; e = [1, ..., 1]' ; and o 1 = x'Qx. Besides, 
the optimal portfolio weights x satisfy the following relation: 

x=Q (n-zoe)- T77TT7 T- ( 10 ) 

Note that (9) and (10) have the same form as the mean-variance frontier. How- 
ever, even if Q is a symmetric matrix (it is positive definite), the estimator 
proposed in the sub-Gaussian case (see formulas (6) and (7)) generally is not 
symmetric. Therefore we could obtain an inconsistent situation in which x'z has 
stable distribution with negative squared scale parameter. However, in most of 

the cases x'Qx > for every vector x £ R n , because generally ^ — — 

positive definite matrix 1 . 



1 ,- (q+(.q)') 

Observe that for every x £ R n , we get x Qx > if and only if ^ i s a positive 

(q+{q)') 

definite matrix. Thus, we can verify that - 1 ^ ' 1S positive definite in order to avoid 

stable portfolios x'z with negative scale parameter estimators. Moreover, we observe that 

(q+(q)') 

the symmetric matrix - 1 ^ ' 1S an alternative estimator of the dispersion matrix Q whose 

statistical properties have to be proved. 
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Suppose z = [zi,...,z n ]' is ot.\ — stable sub-Gaussian distributed with mean 
x'fi and dispersion matrix Q ai . If we approximate the vector distribution with 
an a<2— stable sub-Gaussian law with 1 < a<i < a±, mean x' fi and dispersion 

matrix Q a2 , then, Q a2 fa I j7~~n ) Qai an d there are no consequences of this 
approximation error because the results of the portfolio selection problem (8) 
do not change. While if 2 > a<i > a± > 1, we cannot guarantee the same 
results of the portfolio choice problem. This first difference is one of the reasons 
for considering and studying the convergence properties of the estimator (see 
Rachev [20]) and the suitability of the model. Moreover, (10) exhibits the two 
fund separation property for both the stable and the normal case (see Ross 
[26]), but the matrix Q and the parameter a have different meanings. In the 
normal case, Q is the variance-covariance matrix and a is the standard deviation, 
while in the stable case Q is a dispersion matrix and a = ^/x'Qx is the scale 
(dispersion) parameter. According to the two- fund separation property of the 
sub-Gaussian a-stable approach, we can assume that the market portfolio is 
equal to the risky tangent portfolio under the equilibrium conditions (as in the 
classic mean- variance Capital Asset Pricing Model (CAPM)). Therefore, every 
optimal portfolio can be seen as the linear combination between the market 
portfolio 

x' Z = iv^-^f , (ii) 

and the riskless asset return Zq. Following the same arguments as in Sharpe, 
Lintner, Mossin's mean- variance equilibrium model, the return of asset i is given 
by: 

E( Zi ) = z + f3 hm (E(x'z) - z ) , (12) 

where /3j jm = =7q= , with e l = [0, ..., 0, 1, 0, ..., 0]' the vector with 1 in the i — th 
component and zero in all the other components. 

3. A COMPARISON BETWEEN THE NORMAL MULTIVARIATE DISTRIBUTIONAL 
ASSUMPTION AND THE STABLE SUB-GAUSSIAN ONE 

In this section we examine and compare the stable sub-Gaussian assumption 
with the normal distributional one. Thus, we implicitly assume that returns are 
uniquely determined by the mean m and a that is either the scale parameter of 
stable distributions or the standard deviation of normal distributions. 

In a recent work Rachev, Ortobelli and Schwartz [22] compare the stable 
non-Gaussian assumption and the normal one by analyzing optimal allocations 
between a riskless return and a benchmark index. Three different indexes have 
been taken into consideration: CAC40, DAX30 and S&P500. Next, we extend 
Rachev, Ortobelli and Schwartz's comparison to the multivariate case. This 
comparison is formally and theoretically different from the previous one because 
here the benchmark index is given by the market portfolio which generally will 
change, if the distributional assumptions change too. Thus, as a consequence of 
Roll [23], [24], [25], Dybvig and Ross' [8], [9] analysis, we observe that: 
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a) an investor, who fits the return distributions with a joint ai-stable sub- 

Gaussian distribution, will consider as inefficient the choice of another in- 
vestor who fits the return distributions with a joint a2-stable sub-Gaussian 
distribution with a\ ^ a<i\ and 

b) the stable CAPM is still subject to some of the criticism already addressed 

to the classical one. 

Nevertheless, it seems that the stable case better explains the empirical data. 
This is the main reason why we interpret and analyze the different behavior here 
between the investor who fits the data with joint stable sub-Gaussian distribu- 
tion and the investor who fits the data with the joint normal distribution. 

3.1 An optimal allocation problem 

First, we consider the optimal allocation among 24 assets: 23 of those assets 
are risky assets with returns z = [z±,Z2,..., £23]' and the 24th is riskfree with an 
annual rate of 6%. We analyze the portfolio choice problems when short sales 
are allowed and when short sales are not allowed. In view of this comparison, we 
discuss and study the differences in portfolio choice problems without examining 
them so as to choose one of the two assumptions (Gaussian or sub-Gaussian). 

In our comparison we use daily data taken from 23 international risky in- 
dexes valued in USD and quoted from January 1995 to January 1998. In the 
analysis proposed we first consider the maximum likelihood estimation of the 
stable parameters and of the Gaussian ones for every risky asset. Thus, Table I 
assembles the approximating parameters obtained from using Cognity System 
2 . In order to compare the different stable sub-Gaussian joint distributions and 
the joint normal distributions for the asset returns, we assume that the vector z 
is sub-Gaussian a-stable distributed, with a = a.^, k = I, 2, where ai = 1.7488 
represents the average of the indexes of stability and a<i = 1.8856 represents 
the maximum of the indexes of stability (see Table I) 3 . Moreover, when in the 
following tables we consider the index of stability a = 2, we implicitly assume 
that the returns are jointly normal distributed. Thus, every portfolio of risky 
assets is stable distributed in the following way: 

x z = •~>a k ( CT x'z, Px'z, m x'z), 

where a^ is one of the considered indexes of stability k = I, 2, o x t z = (x'Q^x) 2 

r 2 -1 

is the respective scale parameter, Qk = -¥- is the dispersion matrix, with 

L z Jfe 
k = 1,2, f3 x i z = is the skewness parameter, and m x i z represents the mean of 

x' z. Observe that the matrix Qk is estimated with the method defined in the 

previous section and thus it depends on the index of stability a.}, for k = I, 2. As 

observed previously, the rate of convergence of the empirical matrix Qk to the 



This software is developed by FinAnalytica Inc. 

We consider different indexes of stability, in order to value the effects of heavy - tailedness 
on the portfolio selection problems. 
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ASSETS 


Gaussian Parameters 




Stable Parameters 






Mean,u 


Standard 


Index of 


Stable 


Stable ! 


Stable scale 






Deviation a 


stability a : 


skewness ji 


Meanp | 


oarameter a 


DAX30 


0.0007 


0.0113 


1.8148 


-0.6682 


0.0005 


0.0069 


DAX100 


0.0007 


0.0106 


1.7996 


-0.6389 


0.0004 


0.0064 


CAC40 


0.0005 


0.011 


1.8381 


-0.1852 


0.0004 


0.0071 


FTSE all share 


0.0007 


0.007 


1.8418 


-0.5726 


0.0006 


0.0045 


FTSE100 


0.0008 


0.0078 


1.8856 


-0.5192 


0.0007 


0.0052 


FTSE actuaries 350 


0.0007 


0.0072 


1.8521 


-0.5666 


0.0006 


0.0047 


REUTERS Commodities 


-0.0002 


0.0072 


1.7959 


-0.2075 


-0.0003 


0.0045 


NIKKEI 225 Simple Average 


0.0005 


0.0157 


1.663 


-0.0483 


0.0004 


0.009 


NIKKEI 300 Weigh. Stock Av. 


0.0006 


0.0137 


1.6962 


0.0869 


0.0006 


0.0079 


NIKKEI 300 Simple Stock Av. 


0.0004 


0.0129 


1.7064 


0.085 


0.0004 


0.0075 


NIKKEI 500 


0.0003 


0.0128 


1.7253 


0.0334 


0.0003 


0.0076 


NIKKEI 225 Stock Average 


-0.0005 


0.0158 


1.6798 


-0.0721 


-0.0006 


0.0091 


NIKKEI 300 


-0.0005 


0.0138 


1.6994 


0.0303 


-0.0005 


0.008 


BRENT Crude 





0.0185 


1.7423 


-0.229 


-0.0003 


0.0112 


BRENT Current Month 





0.0186 


1.7405 


-0.2039 


-0.0001 


0.0112 


CORN n. 2 Yellow cents 


0.0002 


0.0152 


1.6869 


-0.1565 


0.0002 


0.0083 


COFFE BRAZILIAN 


0.0002 


0.0270 


1.5876 


-0.0153 


0.0007 


0.0144 


DOW JONES FUTURES 1 


-0.0001 


0.0055 


1.8063 


-0.4641 


-0.0002 


0.0035 


DOW JONES Commodities 


-0.0001 


0.0079 


1.6806 


-0.1389 


-0.0001 


0.0037 


DOW JONES INDUSTRIALS 


0.0013 


0.0086 


1.7368 


-0.2886 


0.0012 


0.0049 


FUEL OIL N. 2 


-0.0001 


0.0201 


1.7338 


-0.1961 


-0.0002 


0.0117 


GOLDMAN SACHS Comm. 





0.0092 


1.8036 


-0.2663 


-0.0002 


0.0058 


S&P500 


0.0009 


0.0083 


1.7052 


-0.0881 


0.0010 


0.0047 


Table I 















Maximum likelihood estimations of the Gaussian and Stable asset return parameters 
considering daily data from 1/3/95 to 1/30/98. 
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unknown matrix Qk will be faster for a large sample, if p is as small as possible. 
However, studying stable simulated data we obtained very good approximations 
even using p not too small (we refer to Lamantia, Ortobelli and Rachev [13] for 
further studies on this problem). In our estimations we use p\ = 1.6 (relative 
toai = 1.7488) and y>2 = 1-7 (relative to ot2 = 1.8856). 

We assume the investors wish to maximize the following utility functional: 

U(W) = E(W) - cE (\W - E(W)\ q ) , (13) 

where c and q are positive real numbers, W = Xzq + (1 — X)x'z is the return 
of the portfolio, Zq is the risk-free asset return, and x' z is the tangent portfolio 
return given by equation (11). With reference to the allocation problem (13), we 
observe that risk averse investors should choose a portfolio W = Xzq + (1 — X)x'z 
that maximizes the utility functional (13) for some real A and some q £ [l,a). 
We know that for A^ 1, portfolio return W = Xzq + (1 — X)x' z is distributed 
according to a stable law 

S ak (\l - X\ax> z ,0, Xz + (1 - X)m,x> z ); k = l,2 

and W = Zq when A = 1. Now, in order to solve the asset allocation problem 

maxE(W) - cE (\W - E(W)\ q ) , 

notice first that, for all q £ [1, a) and 1 < a < 2, we get 

U(W) =E(W)-cE(\W-E(W)\ q ) = 
= Xzo + (1 - A)m ?z -c(H (a, 0, q)) q \1 - X\ q o±, z 

where 

(fl(a ,o.,)).— L_- "^illffi 



A{a,q) r(l 



i\ 



2) 

(see the above lemma and Samorodnitsky and Taqqu [27]). The above relation 
analyzes the stable non-Gaussian case. When the vector z admits a joint normal 
distribution (i.e. a = 2), then for all q > 0, 

U(W) = E(W) - cE (\W - E(W)\ q ) = 
= Xz + (1 - A)mj- Z - c y ' \1-X\ q <7|, z 

Hence, the real optimal solution of the problem in the important case q £ (1, a), 
is given by 



A = 1 - sgn(l - A) ( ^^ - ) (14) 



sg n(l- A) (m ?z ~^ o)V ' 
<?co-|/ z ^(a,0,g) 

and x = (1 — X)x, (15) 

where x is given by (11) and 
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{(H (a, 0, q)) q in the stable case (l<a < 2) 
^1 in the normal case (a = 2) " 

Again, one would expect that the optimal allocation is different because the 
constant V (a, 0, q) and the matrix Q are different in the stable sub-Gaussian 
and in the normal case. 

3.2 Stable versus normal optimal allocation: a first comparison 

We analyze the differences in optimal allocations with reference to problem 
(13) when the investor chooses: 

1. joint normal distribution 



2. joint a-k stable sub-Gaussian distribution (k = 1,2) where a\ = 1.7488; 
a 2 = 1.8856 

as a model for the asset returns in his/her portfolio. Under these distinctive 
assumptions, the investors with utility functional (13) have different information 
about the distributional behavior of data. In particular, we examine the different 
market portfolio composition and the different investor's wealth allocation in the 
riskless asset. 

First, when short sales are allowed and when short sales are not allowed, 
we examine optimal allocation among the riskless return and 23 index-daily re- 
turns: DAX 30, DAX 100 Performance, CAC 40, FTSE all share, FTSE 100, 
FTSE actuaries 350, Reuters Commodities, Nikkei 225 Simple average, Nikkei 
300 weighted stock average, Nikkei 300 simple stock average, Nikkei 500, Nikkei 
225 stock average, Nikkei 300, Brent Crude Physical, Brent current month, Corn 
No2 Yellow cents, Coffee Brazilian, Dow Jones Futuresl, Dow Jones Commodi- 
ties, Dow Jones Industrials, Fuel Oil No2, Goldman Sachs Commodity, S&P 500. 
We use the riskless return 6% p. a.. Using the estimated daily index parameters, 
we can compute the dispersion matrices and the approximating "market" port- 
folios. The dispersion matrix Q is given by either the variance-covariance matrix 
(in the normal case) or the matrix Qk (in the stable cases) which depends on 
the index of stability a^ for k = 1, 2 {a.\ = 1.7488 and a<i = 1.8856). Therefore, 
as shown by Tables II, III, the market portfolio weights 

__ Q- 1 (ji-z e) 



e'Q 1 n — e'Q 1 ez 



change under the different distributional assumptions. We observe that the mar- 
ket portfolio composition does not change excessively when we use either the 

asymmetric estimator (6) and (7) of matrix Qk or the symmetric one 1 — —*-. 

However, using daily data the elements of the dispersion matrices are of order 
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Stable 



ASSETS 

DAX30 

D AX 100 

CAC40 

FTSE all share 

FTSE100 

FTSE actuaries 350 

REUTERS Commodities 

NIKKEI 225 Simple Average 

NIKKEI 300 Weigh. Stock Av. 

NIKKEI 300 Simple Stock Av. 

NIKKEI 500 

NIKKEI 225 Stock Average 

NIKKEI 300 

BRENT Crude 

BRENT Current Month 

CORN n. 2 Yellow cents 

COFFE BRAZILIAN 

DOW JONES FUTURES 1 

DOW JONES Commodities 

DOW JONES INDUSTRIALS 

FUEL OIL N. 2 

GOLDMAN SACHS Comm. 

S&P 500 



Gaussian 
Weights (a=2) 

1 .2784 
-1.1205 
-0.2022 
14.0729 
-1.6515 
-10.4307 
-1 .4679 
-1 .8360 

19.791 

-1 1 .2504 

15.5214 

0.6395 
-22.4705 

0.1653 

0.046 

-0.0306 

0.0201 
-0.4155 
-0.5327 

4.0654 
-0.2714 

0.2143 
-3.1345 



Weights for 

a=1 .8856 

0.7182 

-0.4594 

-0.1217 

10.669 

-2.5989 

-6.3873 

-1.3451 

0.0114 

19.4478 

-12.0352 

15.531 

-1 .2088 

-21 .3744 

0.185 

0.026 

0.0032 

0.017 

-0.2771 

-0.6422 

3.7832 

-0.274 

0.2356 

-2.9032 



Weights for 

a=1 .7488 

0.3929 

-0.0704 

-0.0694 

9.1802 

-3.0513 

-4.5968 

-1 .2833 

0.9302 

19.7594 

-12.7675 

15.8399 

-2.1322 

-21 .2774 

0.2125 

0.0044 

0.0295 

0.0175 

-0.2376 

-0.7123 

3.7278 

-0.2806 

0.2473 

-2.863 



Table II 



Stable sub-Gaussian and Gaussian market portfolio weights 
when short sales are allowed 





Gaussian 




Weights 


ASSETS 


(a=2) 


DAX100 


0.0732 


FTSE all share 


0.2496 


NIKKEI 300 Weigh. Stock Av. 


0.0485 


DOW JONES INDUSTRIALS 


0.6287 


OTHERS 






Stable 



Weights for 


Weights for 


a= 1.8856 


a= 1.7488 


0.0742 


0.0746 


0.2379 


0.2329 


0.0502 


0.0509 


0.6377 


0.6416 









Table III 



Stable sub-Gaussian and Gaussian market portfolio weights 
when no short sales are allowed 
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Gaussian 




Market 


PARAMETERS 


Portfolio 




(ot=2) 


MEAN 


0.0231 


STANDARD DEVIATION 


0.0418 


DISPERSION IF a=1 .8856 


0.0183 


DISPERSION IF a=1 .7488 


0.0101 



Stable 



Market 


Market 


Portfolio 


Portfolio 


oc=1.8856 


a=1.7488 


0.0236 


0.0243 


0.0433 


0.0451 


0.0185 


0.0191 


0.0101 


0.0104 



Table IV 



Stable sub-Gaussian and Gaussian market portfolio parameters 
for every mean-dispersion plane when short sales are allowed 



10~ 6 . Thus the approximation in the data used could be determinant for the 
elements of the matrices. In particular, Table II presents the market portfolio 
weights when we consider all 23 asset returns and short sales are allowed. Table 
III gives the market portfolio weights when no short sales are allowed. Under 
this constraint, we value the market portfolio weights in terms of the risky port- 
folio compositions which maximize the extended Sharpe ratio, i.e. the market 
portfolio weights are the solution of the following optimization problem 

E(x' z) — Zn 

max— — - — - 
x a *'z 

x'e = 1 
Xi > i = 1, ...,n 

In this case the optimal allocation is reduced only among the four risky 
assets: DAX 100 Performance, FTSE all Share, Nikkei 300 weighted stock av- 
erage, Dow Jones Industrials and the riskless one. As argued by Roll [23] , [24] , 
Dybvig and Ross [8], different market portfolios imply a completely different 
security market line analysis. Thus, the approach which takes into account 
short sales presents more opportunities of earning than the approach with no 
short sales constraint. Therefore, it dominates the other approaches. Besides, 
if the returns are jointly a^ stable sub-Gaussian distributed (for some deter- 
mined k = 1,2), then the Gaussian approach is inefficient. Since, in general, 
efficient and inefficient portfolios can plot above and below the "real" security 
market line. The analysis of Tables II and III points out that the composition 
of the market portfolio is strictly linked to the index of stability. In fact, we see 
that the allocation of the market portfolio in each asset component is generally 
monotone with respect to the stability index. Then the intuition suggests that 
the stable sub-Gaussian approaches take more into consideration the component 
of risk because of the heavy tails. Recall that the tail behavior of every stable 
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Gaussian 




Market 


PARAMETERS 


Portfolio 




(o=2) 


MEAN 


0.001072 


STANDARD DEVIATION 


0.006339 


DISPERSION IF a=1. 8856 


0.002783 


DISPERSION IF a=1. 7488 


0.001541 



Stable 



Market 


Market 


Portfolio 


Portfolio 


a=1.8856 


a= 1.7488 


0.001077 


0.001079 


0.006376 


0.006393 


0.0028 


0.002807 


0.001549 


0.001553 



Table V 



Stable sub-Gaussian and Gaussian market portfolio parameters 
for every mean-dispersion plane when no short sales are allowed 



non-Gaussian distribution X = S a (a, /?, /i), with 1 < a < 2, is given by 
lim X a P(±X > A) = cJ^-o a 

A— >+oo 2 



(16) 



where C n 



Therefore, the fat tails of smaller stability indexes 



r(2-a)co S ; 

underline the risk of the loss component of every portfolio. In particular, under 
the diverse distributional assumption, we distinguish the different perception of 
risk in the market portfolio components. This issue can be easily analyzed in the 
market portfolio weights with reference to the 23 returns when no short sales are 
allowed. In fact, Table I shows that the index of stability of FTSE all Share is 
greater than the other indexes of stability (of the assets DAX 100 Performance, 
Nikkei 300 weighted stock average, Dow Jones Industrials). Observe that in 
Table III the component of the FTSE all Share in the market portfolio increases 
with the index of stability a^ of the sub-Gaussian approach and the component 
of the other assets (DAX 100 Performance, Nikkei 300 weighted stock average, 
Dow Jones Industrials) decreases with the index of stability. Thus, the market 
portfolios obtained under Gaussian and sub-Gaussian distributional hypotheses 
consider the risks due to heavy tails differently. On the other hand the mean 
of market portfolios decreases with the index of stability. However, if we accept 
the idea that the market portfolios represent in some sense the market behav- 
ior, then according to the classic mean-risk interpretation, an optimal portfolio 
that has a greater mean, it has also a greater risk. This fact appears clear 
enough when we consider and compare the dispersion measures y/x' k QjXk in 

every mean-risk plane for every market portfolio weights Xj~ = J{ ~ /r °-i — ' 

for every k and j. Observe that Oj,k = \Z%'kQj%k is the dispersion measure 
of market portfolio x' k z considering the (Xj -stable Paretian approach. There- 
fore, for every fixed mean-risk plane (i.e. for every fixed (Xj stable distributional 
approach) we can compare the market portfolio risk positions considering their 
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Gaussian 




Market 


PARAMETERS 


Portfolio 




(a=2) 


! 1% WITH SHORT SALES 


0.0742 


! 5 % WITH SHORT SALES 


0.0457 


! 1% NO SHORT SALES 


0.01366 


! 5% NO SHORT SALES 


0.00934 



Stable 



Market 


Market 


Portfolio 


Portfolio 


a=1.8856 


a=1 .7488 


0.0456 


0.02451 


0.0212 


0.00241 


0.0094 


0.00621 


0.00569 


0.00291 



Table VI 



Stable sub-Gaussian and Gaussian market portfolio Values at Risk 



risk position Oj^ (varying k). According to a mean-risk interpretation, we could 
observe that the market portfolio with a greater mean admits also a greater dis- 
persion measure (Jj,k, in any mean-risk plane (see tables IV and V). However, we 
obtain different results if we use the Value at Risk, VaR 7 , as the risk measure 
of the market portfolios, which is implicitly defined by the following equality: 

VaRy(x'z) = sup {y : P(x'z < —y) > 7} . 



In fact, VaR measures the risk of loss which is represented by the left tail 
of the market portfolio distribution. Then as reported by Table VI we could 
observe that the VaRo.01 and VaRo.05 of the Gaussian market portfolio are 
greater than the analogous VaR 7 of the Sub-Gaussian market portfolios. In this 
sense the Gaussian market portfolio is riskier than the stable market portfolios 
because it does not consider the risk due to the heavy tails. Moreover, comparing 
the different VaR numbers, we can identify the market portfolio with index of 
stability ai = 1.7488 as the least risky. 

As a consequence of relation (16) it follows that every stable non-Gaussian 

distribution X = S a (o, /?, ji), with 1 < a < 2, has the property that 



E(\X-E(X)\ q ) <oo iorq<a 
and E(\X-E(X)\ q ) = 00 for q > a 



(17) 
(18) 



Hence, the weight of the risk measure E ( | X — E(X) \ q ) in optimization problem 
(13) is generally greater for the investors who use the stable laws for asset returns 
when q is quite close to the index of stability a. 

In Tables VII and VIII we list the optimal allocation A for the normal and 
the stable fit. Recall that A is the optimal proportion of funds invested in 
the risk free asset which maximizes E(W) — cE (\W — E(W)\ q ) , where W = 
Xzq + (1 — X)x'z. We have chosen q = 1.45 in Table VII and q = 1.55 in Table 
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Allocation A in the riskless asset considering the market portfolio on 23 assets 
when unlimited short sales are allowed 



A when 



Stable 

Optimal allocation A Optimal 
when a=1.8856 

-2.3803 
-1.1642 
-0.0863 

0.4857 

0.6509 

0.7672 

0.8347 

0.9252 

0.9696 

0.9856 



Coefficient "c" of Optimal allocation 

the optimization A when <x=2 

problem (Gaussian) 

C=1.8 -0.0715 

c=2.2 0.314 

c=3 0.6557 

c=4.2 0.837 

c=5 0.8893 

c=6 0.9262 

c=7 0.9476 

c=10 0.9763 

c=15 0.9904 

c=21 0.9954 

Allocation A in the riskless asset considering the market portfolio on 
when short sales are not allowed 

Coefficient "c" of Optimal allocation Stable 

the optimization A when a=2 Optimal allocation Optimal 

problem (Gaussian) 

c=1.4 0.3788 

c=1.5 0.4671 

c=1.8 0.6446 

c=2.2 0.7725 

c=3 0.8858 

c=4.2 0.9459 

c=5 0.9633 

c=6 0.9755 

c=7 0.9826 

c=10 0.9921 



allocation 
01=1.7488 

-10.2855 
-6.2252 
-2.6268 
-0.7171 
-0.1655 
0.2227 
0.4482 
0.7502 
0.8985 
0.9520 
23 assets 



A when «=1.8856 







0.3024 

0.6498 

0.8342 

0.8875 

0.925 

0.9467 

0.9759 



A when 



allocation 
«=1.7488 











0.4559 

0.6306 

0.7537 

0.8251 

0.9208 



Table VII 



This table computes the optimal allocation A in the riskless return 6% annual 
rate (daily z =0.00016667) for different risk aversion coefficient c of the 
optimization problem max^ E(W)-cE(\W-E(W)\' 45 ) where W=Xzcr+(^-k)x'r and x'r 
is either the Gaussian Market portfolio (for a=2) or the sub-Gaussian market 
portfolio (for a=1.7488 or a=1.8856). 



368 



S.ORTOBELLI ETAL 



Allocation X in the riskless asset considering the market portfolio on 23 assets 
when unlimited short sales are allowed 



Coefficient "c" of Optimal allocation 
the optimization X when a=2 
problem (Gaussian) 



Stable 
Optimal allocation X Optimal allocation 
when a=1.8856 X when a=1.7488 



c=1.5 




-1.2292 




-4.5346 -11.4475 


c=1.8 




-0.6003 




-2.973 -7.9354 


c=2.2 




-0.1111 




-1.7584 -5.2038 


c=3 




0.3678 




-0.5695 -2.5298 


c=4.2 




0.6571 




0.1487 -0.9146 


c=5 




0.7503 




0.3800 -0.3944 


c=6 




0.8207 




0.5549 -0.001 


c=7 




0.8646 




0.6637 0.2437 


c=10 




0.9292 




0.8242 0.6046 


c=15 




0.9661 




0.9159 0.8108 


Allocation X in l 


ihe riskless asset considering the market portfolio on 23 assets 




when short sales are not allowed 


Coefficient "c" of 


Optimal 


allocation 




Stable 


the optimization 


X when oc=2 


Optimal 


allocation Optimal allocation 


problem 


(Gaussian) 


X when a=1.8856 X when cx=1.7488 


c=1.3 












c=1.5 












c=1.8 




0.0859 







c=2.2 




0.3654 







c=3 




0.6389 




0.1239 


c=4.2 




0.8042 




0.5248 


c=5 




0.8574 




0.6539 0.2505 


c=6 




0.8976 




0.7515 0.4476 


c=7 




0.9226 




0.8123 0.5S26 


c=10 




0.9595 




0.9019 0.7S7S 


Table VIII 











This table computes the optimal allocation X in the riskless return 6% annual 
rate (daily z =0. 00016667) for different risk aversion coefficient c of the 
optimization problem maxxE(W)-cE(\W-E(W)\ 155 ) where W=Xzo+(l-X)x'r and x'r 
is either the Gaussian Market portfolio (for oc=2) or the sub-Gaussian market 
portfolio (for a= 1.7488 or a= 1.8856). 
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VIII, so that q is strictly less than all indexes of stability aj,, k = 1,2 in the 
data set. On the other hand, we want to value and compare the different effects 
of q being more distant or closer to the stability parameters a^. For any given 
allocation problem, we remark in bold character and in italics respectively the 
greatest and the smallest allocation in the riskless asset. Both tables show that 
the greatest diversity is among the optimal allocations corresponding to small 
risk aversion coefficient c. In contrast, the very risk averse investors would take 
a less risky position for every distributional hypothesis and hence the allocations 
in the riskless asset do not change very much. 

As we see from these tables, when q = 1.45 and q = 1.55 the investors who 
fit the data with the Gaussian approach generally assume a less risky position 
than the investors who fit the data with the sub-Gaussian approach. Thus, if the 
stable sub-Gaussian approximation presents greater performance opportunities 
than the Gaussian one (as observed by many empirical analyses) the "stable 
investors" have more opportunities of earning than the "Gaussian investors". In 
particular, the investors with a\ = 1.7488 stable sub-Gaussian approach invest 
less in the riskless asset than the investors who fit the data with the other 
approaches. However, if we consider q much closer to ai in the optimization 
problem (13), then as a consequence of (17) and (18), the investors who fit 
the data with a\ stable sub-Gaussian approach assume a less risky position 
than the investors who fit the data with the Gaussian approach. In this case, 
the "stable investor" has a very risk preserving behavior because he does not 
prefer allocating too much wealth in the risky asset. In this sense, intuition 
suggests that the stable approach with lower index of stability generally is more 
risk preserving than with higher index of stability because the component of risk 
due to the fat tails of asset returns is taken into account. Therefore, the stability 
index plays a strategic role in the stable optimal portfolio selection. Conversely, 
q in the above optimization problem can be regarded as an opportune measure 
of the magnitude to be given to the component of risk due to the heavy tails. 
The importance given to q is intuitively linked to the conditions of the market 
in which the investor operates. 

4. Portfolio selection with arma models 

The study and the analysis of the empirical behavior of data (see, among 
others, Akgiray [1], Campbell [5], Fama and French [11] French, Schwert and 
Stambaugh [12]) have reported evidence that the conditional first and second 
moments of stock returns are time varying and potentially persistent especially 
when returns are measured over long horizons. However, the evidence observed 
on temporal dependence for daily stock returns is not strong (see among others 
Fama and French [12]) and it is still a source of controversy whether stock 
market price models should include long memory or not (see among others Lo 
[14]). In this sense the unconditional models of previous sections represent valid 
and indicative portfolio choice models when returns are measured over short 
horizons. 
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If financial modeling involves information on past market movements, it is 
not the unconditional return distribution which is of interest but the conditional 
distribution which is conditioned on the information contained in past return 
data, or in a more general information set. The class of auto-regressive moving 
average (ARMA) models is a natural candidate for conditioning on the past of a 
return series. These models, especially when the series are stationary, are widely 
used to predict future movements of prices, costs and returns. The reason for 
this popularity lies, in their simplicity and because they give a good approxi- 
mation of a very wide class of stationary sequences. ARMA models have the 
property that the conditional distribution is homoskedastic (i.e. constant con- 
ditional volatility) . In particular, if we model asset returns with an ARMA(p, q) 
model of auto-regressive order p and a moving average order q, returns assume 
the following form: 



z t = a + ^2 a-i^t-i + St + ^ b i 
i=i j=i 



-j' 



where {e t } is a white noise process. To specify the orders p and q, we can follow 
the standard Box- Jenkins identification techniques (see among others, Box and 
Jenkins [3], Brokwell and Davis [4]) and we inspect the sample autocorrelation 
functions (SACFs) and sample partial autocorrelation functions (SPACFs) of 
the return series. In particular, we can assume that the sequence of innovations 
{e t } is an infinite variance process consisting of i.i.d. symmetric random vari- 
ables in the domain of normal attraction of a symmetric stable distribution with 
index of stability a £ (0, 2) and scale parameter Cq > 0. That is, 



N 



N~ 1/a J2 £ t^ Y as N 



where Y has characteristic function $y(m) = E(e luY ) = e~< a ° u < . Under these 
assumptions, Mikosh, Gadrich, Kluppelberg and Adler [17] determined esti- 
mators for this process based on the sample periodogramm of z t , and studied 
their asymptotic properties. Other estimators (a Gaussian-Newton type and an 
M-estimator) for the parameters of the ARMA process with infinite variance 
innovations were proposed and studied by Davis [7]. It is interesting to observe 
that in contrast to ARMA processes with finite variance, in the stable case we 
generally obtain an estimator the rate of convergence of which is considerably 
faster. 

Next we consider a portfolio choice ARMA(1,1) model and we want to com- 
pare the impact of stable residuals and Gaussian ones. So we propose a dynamic 
portfolio choice among three risky indexes Dow Jones Industrial, DAX 100 and 
FTSE all share. We consider 5000 portfolios of these assets and we assume 
that portfolio returns admit the form x' z t = cio + a\x' z t -\ + e t + &i£t-i where 
we suppose the sequence of innovations {e t } are either Stable or Gaussian dis- 
tributed. We consider 803 daily observations of index returns from 01/04/1995 
till 01/30/98 and for each portfolio we verify stationarity and we estimate the 
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Empirical CDF 



0.8 



0.6 



0.4 - 



0.2 - 




-0.03 




Empirical residuals 

— Generated residuals 
(normal scenario) 



-0.02 



-0.01 



0.01 



0.02 



0.03 



0.04 



Figure 1 



Kolmogorov-Smirnov test for Gaussian distributed residuals. 



parameters of the model. By first comparison between the Gaussian and the 
stable non-Gaussian hypothesis it appears clear that stable distributions ap- 
proximate much better the residuals than the Gaussian one. As a matter of 
fact, with the Kolmogorov-Smirnov test we can compare the empirical cumula- 
tive distribution function (cdf) Fe(x) of the residuals corresponding to several 
portfolios with either a simulated Guassian or a simulated Stable distribution 
fitted in advance. By this first analysis we can generally reject at 5% confidence 
level the hypothesis of normality because we obtain that the probability that 
the empirical distribution of the residuals is Gaussian, is on average (among 
different portfolios) equal to 1.2xl0~ 6 <C 5%. In addition we could observe that 
on average (among different portfolios) k = sup \Fe(x) — F(x)\ = 0.1875 where 

X 

F(x) is the fitted cdf of the Gaussian law. Both functions, Fe(x) and F(x), are 
shown on Figure 1. 

In contrast, generally we cannot reject at 5% confidence level the hypothesis 
that the residuals follow a stable law because the probability, that the empirical 
distribution of the residuals is stable, is on average (among different portfolios) 
equal to 20.22%. In addition we could observe that on average (among different 
portfolios) k = sup \Fe(x) — F(x)\ = 0.075 when F(x) is the cdf of a stable law. 

X 

Both functions, Fe(x) and F(x), are shown on Figure 2. 

In order to value the impact of different distributional approximations on 
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Empirical CDF 
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Empirical residuals 

— Generated residuals 
(stable scenario) 
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-0.04 



-0.03 



-0.02 



-0.01 



0.01 0.02 



0.03 



Figure 2 



Kolmogorov-Smirnov test for Stable distributed residuals. 



investor's portfolio choice we consider a dynamic asset allocation approach very 
similar to those proposed by Boender [2] and Rachev, Ortobelli and Schwartz 
[22]. So we generate about 2500 initial asset allocations. These allocations 
are then simulated into the future by using the economic scenarios, which are 
generated under the Gaussian and stable assumptions for the innovations of the 
time series models. Future economic scenarios are simulated at daily intervals. 
One set of scenario is generated by assuming that residuals of the variables 
are i.i.d. normal and another set of scenario is generated by assuming that 
residuals are i.i.d. stable. The horizon of interest is 10 days and two scenarios 
are generated for each day, so 1024 possible economic scenarios are considered 
for each initial portfolio. The 10-day scenario tree is repeated 10000 times. We 
assume that investors wish to maximize the following functional of final wealth: 

U(W T ) = E(W T ) - cE (\W T -£(T^ T )| 1,5 ) 

where c is a coefficient of investor's risk aversion. Therefore, the investor will 
choose among the initial portfolios the portfolio weight vector x = [xi,X2,X3]' 
(xi > 0, X\ + x<i + x% = 1) which maximizes the utility functional U(Wf) con- 
sidering that E(Wj,) = t; J2 S = 



with portfolio x; E (\Wf 
of risk associated to x ; 



l ^Tt ^ t ne mean of final wealth that we obtain 
E(W%)\) = | Y,s=i \ W s,T ~ E ( W t)\ is the measure 



Ws*T 



n 



i 



■ R lt) 



1 is the final wealth that 
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Stable innovations 




Risk 




Optimal portfolio composition 


aversion 


Expected 


Dow 




parameter 


Utility 


Jones DAX100 


FTSEall 


"c" 




Industrials 


share 


0.1 


0.00318 


0.01643 0.96489 


0.01866 


0.3 


-0.00286 


0.01643 0.96489 


0.01866 


0.5 


-0.006 


0.06585 0.33784 


0.5963 


0.7 


-0.00853 


0.06585 0.33784 


0.5963 


0.8 


-0.0097 


0.05988 0.31699 


0.62312 


2.2 


-0.0247 


0.0813 0.21418 
Gaussian innovations 


0.70451 


Risk 




Optimal portfolio composition 


Aversion I 


Expected 


Dow 




Parameter 


Utility 


Jones DAX100 


FTSEall 


"c" 




Industrials 


share 


0.1 


0.00106 


0.004192 0.520012 


0.475796 


0.3 


-0.00295 


0.04557 0.305768 


0.648662 


0.5 


-0.0061 


0.047021 0.268504 


0.684475 


0.7 


-0.00902 


0.05344 0.288484 


0.658076 


0.8 


-0.01013 


0.053643 0.239928 


0.706429 


2.2 


-0.02575 


0.053643 0.239928 


0.706429 


Table IX 









Maximum E(W 10 )-cE(\W 10 -E(W 10 )\) and portfolio 
composition considering return scenario generated with an 
ARMA{ 1,1) model under the Gaussian and stable 
assumptions for the innovations of the time series models. 



we obtain with portfolio x under scenario s £ {1, 2, ..., S{ ; R x s t = X\Zi ySyt + 
X2%2,s,t + XzZz,s,t is the return of portfolio x under scenario s £ {1,2, ..., S{ 
in time period t and z^ st is the rate of return of i — th asset under scenario 
s £ {1,2, ..., S{ in time period t. The calculations for this empirical analysis 
were performed either in Matlab environment or programmed in Delphi. Table 
IX summarizes the results of this comparison. In particular we observe that 
for each risk aversion coefficient we obtain greater expected utility using stable 
distributed residuals. Thus it is implicitly confirmed that we have presumed a 
better distributional approximation. Just as in the case of the previous empir- 
ical comparison we observe that there exist substantial differences between the 
portfolio allocations under the different distributional approaches. 

4. Conclusions 



In this paper we have shown that the classical portfolio choice models can be 
generalized assuming stable distributions for the underlying random variables 
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and that the generalized models not only are theoretically justifiable and empir- 
ically testable, but they generally have better performance than the respective 
Gaussian models when asset returns exhibit heavy tails. 

By comparing the joint normal distribution with the joint stable sub-Guassian 
one, it has occurred that the results received from the examined optimal allo- 
cation problems are substantially different. In particular, the stable market 
portfolio is generally less risky than the Gaussian market portfolio. This in- 
tuitive result is confirmed by comparison of the optimal allocations when the 
different distributional hypotheses are assumed. Therefore, the investors who 
fit the data with the stable distributions are generally more risk preserving than 
the investors who fit the data with the Gaussian law because stable laws take 
into account the component of risk due to heavy tails. Thus, we find that the 
tail behavior of sub-Gaussian and Gaussian approaches could imply substantial 
differences in the asset allocation. 

These results are empirically confirmed if we compare portfolio choices ob- 
tained when considering portfolio returns that follow an ARM A( 1,1) model with 
stable or Gaussian distributed residuals. As a matter of fact, we observed that 
the distribution of residuals is asymmetric and leptokurtic and the hypothesis 
of normality is usually rejected under statistical testing. In addition, we show 
that the approximation given by the stable residuals imply better performance 
for risk averse investors. 
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Abstract 

Weighted (or product) double integral inequalities are developed and extended to pro- 
duce weighted cubature rules. The error bounds are of first and second order and rely on 
the first few moments of the weight. Various properties of the weight and weight null- 
spaces are considered. Minimization of the bound produces coupled non-linear equations 
whose solution furnish optimal weighted cubature grids. These grids are evaluated for 
some of the more popular weight functions. 

Key words: Integral inequalities, cubature, singular integration, grid generation 



1. Introduction 

Milovanovic [3] (see also [4]), Barnett and Dragomir [1] and Hanna et al. [2] developed two dimensional 
integral inequalities whose error bounds were expressed in Lebesgue norms of the first partial derivatives of 
the integrand. In other work, Roumeliotis [7] developed and reviewed weighted one dimensional Ostrowski 
type inequalities with a particular emphasis of identifying optimal quadrature grids. These grids, influenced 
by the first few moments of the weight function, were evaluated via minimization of the Ostrowski type error 
bound. In this paper we combine and extend these results to develop weighted first and second order double 
integral inequalities. Particular attention is paid to the influence of the two dimensional weight function on the 
error bound and we explore this influence for different weights and weight null-spaces. Furthermore, weighted 
second order cubature rules are developed and we devise a method for calculating cubature grids that rely 
only on the first two moments of the weight. A method for calculating a priori cubature grids is given. 

The work in this paper is presented in the following order. In Section \2[ a two variable Taylor expansion 
is employed to develop weighted two dimensional integral inequalities. Milovanovic [3] used this method to 
extend Ostrowski's inequality to multiple dimensions. Here we will content ourselves with two dimensions, but 
extend the order of the rule to two. We undertake an examination of the error bound and identify parameters 
that will minimize the bound. In Section [3J we present a Peano kernel method, based on analogous results 
in [1], to derive a second order weighted double integral inequality. Error bounds are expressed in terms of 
the L\ and L^ norms of the first mixed partial derivative of the integrand. Particular attention is paid to 
minimizing this integrand for different weights and null-spaces. Finally, the results of this section are extended 
in Section \5\ to develop a weighted cubature formula. Minimizing the error bound furnishes a set of non- linear 
coupled equations in the first two moments of the weight whose solution produces a cubature grid influenced 
by the weight function. Plots of the grid for various weights are given. 

2. Taylor's Formula 

In 1975, Milovanovic [3] generalised the Ostrowski inequality to multiple dimensions using the multiple 
variable Taylor formula. As per the Ostrowski result, the inequality was expressed in terms of the first partial 
derivatives of the integrand. We state the two dimensional formula below. 

Following [3], let D = {(xi_,X2)\di < Xi < bi(i = 1, 2)} and let D be the closure of D. 



Theorem 1. Let f : M. 2 — ► R be a differentiable function defined on D and let 
in D. Then, for every X = (xi, x 2 ) G D. 

i-bi pb 2 






<Mi (Mi >0; i = 1,2) 



(1) 



1 



(h -ai)(&2 -a 2 ) 



f(ti,t 2 )dt 2 dti - f(xi,x 2 ) 

ai J a 2 

< Mxfr ax) f (*i-^) + 1 V M2{b2 a%) ( fa 



Q2+fc2 



(&i-ai) 2 4/ z "" z Z M (62-03) 
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The weighted version of Theorem [TJ appears below. 



Theorem 2. Let f : R 2 —* R be a differentiable function defined on D and let 4^ < M» (Mj > 0; i = 1,2) 

in D. Furthermore, let the function X i— > w(X) be defined, integrable and w(X) > for every X G D. Then 
for every X G D, 



(2) 



frj^wih^m^dhdh 



r hi f b 



/a 1 1 /o a a «'(*l»*a)*2Cftl 



/(»1,*2) 



1 



^ TbTT^ : ; ( Ml / ' [ 2 ™(ti,t 2 )\x 1 ~t 1 \dt 2 dt 1 

Joi Ja 2 W{t 1 ,t 2 )dt 2 dt 1 \ J ai Ja 2 

rbi pb 2 \ 

+ M 2 / / «;(ti,t2)|a;2-t2|dt2dti • 

Jai v/a 2 y 

Theorem [2] can be extended to higher orders and below we provide such an extension to second order. 

Theorem 3. Let f : [ai,bi] x [02,62] — > K 6e suc/i £/ia£ a/Z its partial derivatives up to order 2 exist and be 
continuous, i.e. —3 ■ * k < 00, i — 1, 2; j = 0, . . . , i; k — i — j. Furthermore, let w : (a 1; 61) x (a 2 , b 2 ) — ► (0, 00) 

be integrable (i.e. JJwdA < 00,). Then for all (xi,x 2 ) G [ai,&i] x [02,62] the following second order product 
double integral inequality holds 

b\ t>b 2 rb\ rb 2 

J w(t 1 ,t 2 )f(t 1 ,t 2 )dt 2 dt 1 - f{x 1 ,x 2 ) J J w(ti,t 2 ) dt 2 dti 






(xi,x 2 ) 



fci r b 2 



< 



I d 2 f II 

dt 2 HOO 



ai JO] 
61 /.6 2 



61 ,-b 



w(ti,t 2 )(xi -t\)dt 2 dti + —(xi,x 2 ) / / w(t 1 ,t 2 )(x 2 -t 2 )dt 2 dt 1 



w(ti,t 2 )(xi - ti) 2 di 2 d£i + 



I 9 2 / 1| 
at? N°° 



bi f62 



ai Ja2 



w(tl,i2)(X2 — t 2 ) dt 2 dti 



(3) 



a 2 / 



r 6i /.b 



/■Ol /"02 

/ / w{tl,t2)\x\-ti\\X2-t2\dt2dtl. 



dtidta 

Proof. The two- variable Taylor formula states that 

df df 

(4) f{ti,t 2 ) = f{xi,x 2 ) + (ti -xi) — (#1,0:2) + (*2 - x 2 ) — (x 1 ,x 2 ) 



(t X -X X ) 2 d 2 f d 2 f (t2-X 2 ) 2 d 2 f 

T2-(4l,6) + (^l -Xl){t 2 -X 2 ) (gl,&) + " 7S2-(§1,6), 



at 2 



'a*iflfe 



^ 



where & = i,; + 9(xi — ti), i — 1, 2, < 9 < 1. Multiplying Q by w and integrating produces the identity 

/•bi />b2 /*bi /*b2 

(5) / / w(t 1 ,t 2 )f(t 1 ,t 2 )dt 2 dt 1 -f(xi,x 2 ) / / w(t 1 ,t 2 )dt 2 dt 1 

J a\ J a 2 J a\ J a 2 

!J/ /■&! ("b2 

+ ^r( x i' x 2) / w(t 1 ,t 2 )(xi-t 1 )dt 2 dt 1 

dh J ai Ja 2 

df f bl f b ' 2 

+ ^TWi^) / / w(t 1 ,t 2 )(x 2 ~t 2 )dt 2 dti 

at 2 Jai Ja 2 

bi /-b 2 /j. __ \2 fl2f 

ai J02 ^ OT 1 

bl (-b 2 a2 J 

/ w{ti,t 2 ){ti ~xi)(t 2 -x 2 )— — (^1,^2) dt 2 dt x 

O! A 2 OT1OT2 

W{tl,t 2 ) 2 9+2 (?1>?2)) *2*1- 

Taking the modulus of both sides of ([5|, applying the triangle inequality and then Holder's inequality on the 
right hand side gives |3). D 
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Corollary 4. Let the conditions for f be as in Theorem^ Then the following double integral inequality holds 



(6) 



1 



(6i -ai)(&2 -a 2 ) J ai 



f(ti,t 2 )dt 2 dti -f(xi,x 2 ) + -^-(xi,x 2 ) [xi - 

0,2 



df ( oi + 6i 



< 



2 / 



+ 



2 / 



<%i<% 2 



(&i -ai)(6 2 -a 2 ) 



at 2 

(xx- 



(&i-ai) 2 



^v-x,— / ^— 2 

(6i-«i) 2 Ax!-^^) 2 
2 ^ (6i-ai) 2 

(* 2 -^) 2 
(62 - a 2 f 



01+61 ' 

2 , 



4 



12 

" 4 



+ 



a 2 / 
0*2 



(ft 2 -a2) 2 /(*2-°^ 
2 I (& 2 -a 2 ) 2 



12 



a 



Proof. Substituting ^(^1,^2) = 1 into ([3) and simplifying produces the desired result. 

The point (a:i,X2), the sample point of the integration rule, is free to be chosen. Often, such points are 
chosen to simplify the rule. For example, in {3} if we choose the weight mean 

ll 1 1 ll 2 2 ^w(ti,t 2 )dt 2 dt 1 

/ai/oa'wC*!.^)^*! 

then the partial derivative terms vanish. Fortuitously, in this case, this point also minimizes the bound. In 
the following sub-section, and indeed this paper, we will not be concerned with simplifying the integration 
rule, but instead attempt to determine such parameters (for eg. x\ and x 2 ) in order for the error bound to be 
minimized. 

2.1. Minimizing the upper bound. 

Corollary 5. The bound in equation (f||) is minimized at the median point (xi,2!2) satisfying 



(7) 

and 

(8) 



w(ti,t 2 ) dt 2 dti 



61 



w(ti,t 2 )dt 2 dti 



x 2 nbi rb 2 pbi 

/ w(ti,t 2 )dtidt 2 = / / w(ti,t 2 )dtidt 2 . 

a 2 J a\ J x 2 J a\ 



Proof. It is a simple matter to show that 

l>b\ rbi i>b\ (>b 2 

I(x 1 ,x 2 ) = M 1 / w(t 1 ,t 2 )\x 1 -ti\dt 2 dti+M 2 / w{h,t 2 )\x 2 -t 2 \dt 2 dti 

J a\ J a 2 J a\ J a 2 

is a convex function. Hence the upper bound in |2) is minimized at the stationary point of I. Evaluating the 
first partial derivatives of I produces equations (J7J and (|S) . D 

That is, the minimum point is the median of the weight in each direction. This is consistent with first order 
rules reported in [7]. 

Minimization of the second order bound in Theorem \3\ is not as simple. It is quite difficult to identify a 
minimum point for the upper bound of (3|. This bound is comprised of three components; the first and last 
are minimized at the mean (in each direction) 



(9) 



x\ = 



Ja I 1 /a 2 2 * 1 ' U; (* 1 '*2)^2d*l 



X 2 = 



Sal J a2 2 *2W(tl,t 2 )dt2d*l 



Sal Sal W ^ ' ^ dt2dtl Sal Sal W ^ ' * 2 ) dt2(itl 

while the second is minimized at the root of a median-type expression 



Xx rb 2 rbx /-& 2 

/ \x 2 -t 2 \w(t 1 ,t 2 )dt 2 dt 1 = / \x 2 -t 2 \w(t 1 ,t 2 )dt 2 dt 1 



fb 1 



(10) 
and 

[■X2 pbl r0 2 rDx 

(11) / / \x 1 -t 1 \w(t 1 ,t 2 )dt 1 dt 2 = / \xi-ti\w(ti,t2)dtidt2. 

J a 2 J a\ J x 2 J a\ 

Of course, for weights in which the solutions of (9J) are identical to those of (101 and ( ITT] ) then identification 
of the minimum point presents little challenge. For example if w is a product weight and symmetric about 
the midpoint ( ai 2 bl , ° 2 +° 2 ) then the minimum point is the midpoint. That is, if w(ti,t 2 ) — wi(ti)w 2 (t 2 ) 
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and Wi((a + 6)/2 — t) = Wi((a + 6)/2 + t) (i = 1,2), then it can be shown that the solution of ([9])— (TTT|) is 
the mid-point. This is the case when w = 1 and Corollary \4\ shows that the upper bound is minimized at 
Xi = (a* + bi)/2, i = 1,2. 

The major difficulty with (3) is that the upper bound is comprised of a linear combination of three terms 
involving norms of the partial derivative of the integrand. Hence it would be near impossible to find a global 
minimum that depends only on the weight and not /. To obtain a global minimum for a general second order 
rule will require either simplification of (3} or the derivation of another expression for the bound. The first 
point is dealt with in the corollary below, while the second is taken up in the next section. 

Corollary 6. Let f and w be as given in Theorem^ Then for all {x\ 1 X2) G [ai,6i] x [02,62] the following 
second order product double integral inequality holds 

fci /-&2 rbi rb2 

w(ti,t 2 )f(ti 1 t2)dt 2 dt 1 - f(xi,x 2 ) I I w(ti,t 2 ) dt 2 dti 

61 



a\ J a-2 

Of 



a\ J a-2 



dh 



(xi,z 2 ) / / w(t l ,t 2 ){x 1 -t 1 )dt 2 dt 1 + — (x 1 ,x 2 ) I I w(t 1 ,t 2 )(x 2 -t 2 )dt 2 dt 1 

ai J a~2 Ob2 J a\ J a^ 



< 



s 2 / 
dt\ 



(12) 



2 

d 2 f 
dt x dt 2 



X\ 



l w lli 



a\ + b\ 



2 



a 2 / 
dt\ 



Mli 
2 



x 2 



Xl 



«i + bi 
2 



h - 01 
2 



x 2 



a 2 +b 2 



a 2 + b 2 
2^ 
b 2 - a 2 



b 2 - a 2 
2 



rb\ [-hi 



where ||iu||i = J 1 J w{t\,t 2 ) dt 2 dt\ is the zero-th moment of the weight. 

Proof. The proof involves taking an upper bound of |3) using Holder's inequality. Thus, consider 

b\ rb2 rbi />&2 

/ w(ti,t 2 )(xi - tx) 2 dt 2 dti < sup (xi-ti) 2 / w(ti,t 2 )dt 2 dti 

a\ J a,2 tiG[ai,bi] J a\ J a2 

2 / l \2i 



max{(xi-ai) ,(xi-bi) }\\w\\i 

2 



(13) 

Similarly 
(14) 
Finally, 



Xi 



ai + b\ 



b\ - 01 



Hli- 



61 



w(h,t 2 )(x 2 -t 2 ) 2 dt 2 dt\ < 



x 2 



a 2 + b 2 



b 2 - a 2 



Iklli- 



w(ti : t 2 )\xi - ti\\x 2 - t 2 \ dt 2 dt\ 

< SUp \xi — tl\\x 2 — t2|||lf||l 

(tx ,t 2 )£[a 1 ,b 1 ]xla 2 ,b 2 ] 

= max{xi — 01, 61 — x\} max{i2 — a 2 , b 2 — X2}||w||i 



(15) 



Xl 



ai + bi 



61 -a x 



x 2 



a 2 + b 2 



b 2 - a 2 



^Hi- 



Making use of (p), p3j) and (p) gives (3) 



D 



It is clear that the bound in ( j!2| ) is minimized at the mid-point of the rectangular region. Unfortunately, 
the weight does not influence this minimum point. 

Taylor's theorem is a popular vehicle for developing cubature and higher dimension rules. Stroud [8] uses 
Taylor's expansion to develop cubature rules and recently Qi [5], used this technique to derive weighted Iyengar- 
type multiple integrals. The drawback is in the size of the error bound. For two dimensions, an n-th order rule 
has a Taylor remainder of n + 1 terms. Minimizing any rule with order greater than one would be extremely 
difficult. Thus, in the next section, we turn to the Peano kernel and use the results of \T[ \2\ to derive a second 
order weighted double integral inequality that contains only one term in the upper bound. 



3. Main Results 

Lemma 7. Let f : [ai, 61] x [a 2 , b 2 ] — > R be bounded and integrable and whose first partial derivatives exist and 
are also bounded and integrable. Furthermore, let w : (di,bi) x (02,62) — * (0, 00) be integrable. The following 
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identity holds 

rbi rb 2 

(16) 1= [f(xux 2 )-f(x l ,t 2 )~f(t 1 ,x 2 ) + f(t 1 ,t 2 )}w(t 1 ,t 2 )dt 2 dt 1 

J CLi J CL<2 



fOi rb-2 o2 f 



where x\ <G [01,61], x 2 <G [02,62] and 
(17) P(h,t 2 ) = 



(18) p{h,t 2 



rt 2 

/ p{ti,u 2 )du 2 , a 2 <t 2 < x 2l 

J a 2 

*2 

p{ti,u 2 )du 2 , x 2 <t 2 < b 2 , 
62 

ti 

t/;(ui,i2)dwi, ai < ii < xi, 
01 
*i 

w (ui, t 2 ) du\ 1 xi < ii < bi- 



Proof. To begin, let I = J 1 1 2 dt\ and consider I 2 where 

P{h,u 2 )du 2 — -— - — dt 2 + / / p(ti,u 2 )du 2 — — — — dt 2 

a 2 w« 2 / oti 5t 2 J X2 \Jb 2 J dhdt 2 

= hi + ^22- 

Using integration by parts, we find that 



hi = / P\ti,u 2 )du 2 — - / 

Jan Ot\ J a 



" rX2 df(h,t 2 ) 



„., ■>„., dti 



p(ti,t 2 )dt 



2 



Similarly 



Thus I 2 becomes 



«2 

p(ti,u 2 )du 2 — / — p{t u t 2 )dt 2 

a 2 at l Ja 2 0t l 

, f b2 ,. . , (df(t u x 2 ) df(t u t 2 ) \ , 

h 2 = P(tl,t 2 )[ =- XT dt 2 . 



■r 2 



b. 



V dh dh 



and substituting into / gives 

I = l I PitM—dt^^^ ^ P(t„t2)[—^ i ^-) dt 2dtl 

r " 2 f bl u * \ f df(h,x 2 ) df(h,t 2 ) \ 
(19) = I* I 3 dt 2 , 

J a 2 

where 



'-£«»-»{ ?t %r*-° J %r*) 



dh. 



Applying the same treatment to I3 as for I 2 gives 

rbi 
I 3 = / w(t 1 ,t 2 )[f{x 1 ,X 2 )-f{t U X2)-f(x U t 2 ) + f(tl > t 2 )]dt 1 . 



"1 
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Substituting 1% into ([19]) we find that the identity ([16]) is thus proved. 



D 



The upper bound of the integration rule will depend on P. Below, we detail some properties of P that will 
be subsequently used in analysis of the bound. 

Lemma 8. The kernel P : [ai, 61] x [a 2 , 62] -*I as defined in Lemma\7\ has the following properties: 

(1) P vanishes on the boundary of the rectangle [ai,6i] x [02,62]; 

(2) P(*i, •) : {a 2 , 62) — > R is monotonic increasing for all t\ € (ai, x\), 

(3) P(t\, ■) : (a 2 , 62) — > R *s monotonic decreasing for all *i £ (x\, bi), 

(4) P is positive on (ai,x\) x (02, #2) and (x\,bi) x (x 2 , b 2 ), 

(5) P is negative on (ai,x\) x (3:2,62) and (xi,b\) x (02, #2); 
for all (xi,x 2 ) G (ai,6i) x (02,62). 

Proof. These properties are quite simple to prove via inspection of the first partial derivatives of P. □ 

In Figure QQ we plot the surface and contours of ( [17] ) for two different weights. The plots exhibit the 
properties discussed in Lemma M It is obvious that the kernel achieves its maximum deviation on of its 
branches at the discontinuous point (xi,X 2 ). 

In the following theorem we state the main result by employing the identity in Lemma [7J to produce second 
order weighted double integral inequalities. In contrast with the inequalities of the previous section, the upper 
bound here is comprised of just one term. 

Theorem 9. Let the conditions of Lemma \7\ hold. The following double integral inequalities involving the 
usual Lebesgue norms of the first mixed partial derivative of f hold, 

0*10t 2 



(20) 



l/l < 



OC J O-l J &2 



\xi - *i| \x 2 - t 2 \w (t u t 2 )dt 1 dt 2 , 



if dt J t2 G Loo[ai,6i] x [02,62] and 

d 2 f 
(21) 



0*10*2 



x x px 2 pXx rb 2 

ina\<; / / w(ti,t 2 )dt 2 dti, / w(ti,t 2 )dt 2 dti 

ax J a 2 J ax J x 2 

6l pX 2 rbi pb 2 

\ w(ti,t2) dtzdti, / / w(ti,t 2 ) dt 2 dti 

x\ J a 2 J x\ J x 2 

r. 2 r 

if q-. L G Li[ai,6i] x [02,62], where L is defined in equation (16\). 

Proof. To prove (20) we begin with Holder's inequality and then simplify using Lemma \8\ 

J ax 
d 2 f 



< 



0*10*2 
2 / 



(22) 



f" 1 I" 2 P(t 1 ,t 2 ) d -l}^dt 1 dt 2 

ax Ja 2 5*10*2 

h rb 2 

/ \P(h,t 2 )\dt 2 dti 

x\ rx 2 

/ P(t 1 ,t 2 )dt 2 dh- 

OO \J Q-l J <L 2 

fci rx 2 

/ P{ti,t 2 )dt 2 dti 

X\ J a 2 



0*10*2 



fX\ rb 2 

I P(h,t2)dt 2 dh 

lax J x 2 

fb-, rb 2 

/ P(t u t 2 )dt 2 dti 

'x ± J x 2 

Now each of the terms in (|22|) can be evaluated via partial integration and simplified using Lemma [7J and 
equations ( [T7] ) and ( [181 . For the first term 

fX 2 \ 

(i 2 - x 2 )pdt 2 > dti 



\x 2 
\a 2 



[•xi rx 2 rii f 

\ \ P{t l7 t 2 )dt 2 dt L = / Ut 2 -X 2 )P 
J ax J a 2 J ax L 

rx 2 rxx 

= — / (*2 - x 2 )pdt\dt 2 

J a 2 J ax 



"2 
x 2 rxx 



(*2 ~X 2 )\ (*1 -Xl)p\ 



(*i — Xi)w dti > dt 2 



(23) =/ / (x 2 -t 2 )(x 1 -t 1 )w(t 1 ,t 2 )dt 1 dt 2 . 
Employing the same procedure for the other terms we find 

(24) / / P{t 1 ,t 2 )dt 2 dt 1 = / (x 2 - t 2 )(xi - h)w(ti,t 2 ) dtidt 2 , 
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(25) 



(26) 



bi f>x 2 rbi rx 2 

/ P(t l7 t 2 )dt 2 dti = / / (x 2 - t 2 )(xi - ti)w(ti,t 2 ) dt\dt 2 , 

Xl J a 2 J X\ J Cl2 

b± pb 2 pb\ pb 2 

/ P{t 1 ,t 2 )dt 2 dt 1 = / (x 2 -t 2 )(x 1 -t 1 )w(t 1 ,t 2 )dt 1 dt 2 . 

X± J X 2 J X\ J X 2 



Substituting (J23|)-(f26l) into ( 122] ) gives (|2Q|). To prove ( [21] ) we again begin with Holder's inequality 



i/i = 



< 



pb\ pb 2 
J a\ J a 2 


d 2 f 


dhdt 2 


d 2 f 


dt x dt 2 



Ot\Ot 2 



sup \P{hM) 

1 (*i,*2)e[ai,bi]x[a2,&2] 



fXx rx 2 rxx rb-2 

max< / / w(ti,t 2 )dt 2 dti, / / w{t\,t 2 ) dt 2 dt\, 

1 wai J ci2 J a\ J X2 

fb\ i-X2 rbi i-b 2 

/ w(ti,t 2 )dt 2 dti, / w(ti,t 2 ) dt 2 dti 

I Xl J a<2 J X\ J X2 

The last line being computed by appealing to the properties of P as listed in Lemma [8} Thus the theorem is 
proved. □ 



(27) 



If the first moments of the weight w are known, as well as the one dimensional integrals 

(28) 



hi / rb 2 \ rb 2 I pbx \ 

f(tux 2 ) I / w{ti,t 2 )dt 2 \ dt x and / f(xi,t 2 ) I / w(t ll t 2 )dt 1 \ dt 2 

ai \Ja 2 ) J a 2 \J ai J 

then (201 ) can form the basis of a cubature formula for the evaluation of the weighted double integral 
IId /(^1j t 2 )w{ti, t 2 ) dA over a rectangular region D. A major drawback is that in most cases the integrals 
(281) are unknown. These can be eliminated using the one-dimensional weighted results in [6]. Roumeliotis et 
al. [6] showed that for mappings / with bounded second derivative that 

,.b 



(29) 



b nb nb 

w(t)f(t)dt-f(x) / w{t)dt + f'(x) / (x~t)w(t)dt 



< 



If II 



{x-t) 2 w{t)dt, 



where x € (a, b) and w is a weight function. Thus making use of ([291, the following inequalities hold 



(30) 

and 
(31) 



f(h,x 2 )l w(ti,t 2 )dt 2 J dti - f{xi,x 2 ) J / witi^t^dhdh 

cii \J a 2 J J a\ J a 2 

bi rb 2 



ft f pb\ pb 2 

— ($i,x 2 ) J J (x± - t^wih.t^d^dti 



a\ J a 2 



< 



a 2 / 
dt\ 



oo J a\ J a 2 



w(ti,t 2 ) dt 2 dti 



b 2 I rbx \ rbx rb 2 

f{xi,t 2 )\ I w{tx,t 2 )dti ) dt 2 - f(xi,x 2 ) / / w(ti,t 2 )dt 2 dti 



at (-fci (•''2 

+ -kt(xi,x 2 )I I {x 2 -t 2 )w{t 1 ,t 2 )dt 2 dt 1 



< 



d 2 f 



dt\ 



w{t u t 2 ) dt 2 dh. 



oo J ai J a.2 



J a± J a,2 

It is of interest to note that combining ([20]), (301 ) and ( [31] ) will produce (3). Thus, in one sense, (201 ) is more 
general than (3]) since it is not obvious how one may derive (201 ) from (3J. 

One advantage of (201 ) over (3) is that the upper bound involves one term instead of three. Thus, with (201 
we can find points {x\,X 2 ) that will minimize upper bound in terms of the weight and independent of the 
integrand. In the following corollary we will identify points (xi,X2) to minimize the bound 

(32) J(xi,x 2 )= / \xi-ti\\x2-t a \w(ti,ta)dt2dti. 

J (l\ J (12 

Corollary 10. Sf{xi,x 2 ) as defined in (3M) is minimized at (x\,x 2 ) where x\ and x 2 satisfy the equations 

rx{ rb 2 /•&! rb 2 

(33) / / \x* 2 -t 2 \w(t u t 2 ) dt 2 dh= / \x* 2 -t 2 \w(ti,t 2 ) dt 2 dh 



and 
(34) 



x 2 /"^l 



\x\ - ti\w{ti,t 2 ) dt 2 dti 



bi 



|x* - ti\w(ti,t 2 ) dt 2 dti 
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(35) 



(36) 



J 



(i) 



dj_ 



(x\,x 2 ) 



ox 2 



Proof. Evaluating the partial derivatives of J gives 

r xi pb 2 t-bi pb 2 

/ |*2 - t 2 \w(ti ,t 2 )dt 2 dti - / / \x 2 -t 2 \w(ti,t 2 )dt 2 dti, 

i a\ J a 2 J x\ J az 

r x 2 t-bi pb 2 rbi 

/ \x\ — t\\w{t\,t 2 ) dt\dt 2 — j j \x\ — t\\w{t\,t 2 ) dt\dt 2 . 

' a 2 J a\ J x 2 J a\ 

Inspection of (35] ) reveals that, for fixed x 2 , J^ is monotonic increasing and J^(ai, x 2 ) — —J^{b\,x 2 ) < 0. 
J^ also exhibits similar properties and hence there exists a unique point (x*,x 2 ) that is the zero of ( 1351 ) and 
(36) and minimizes J. D 

The behaviour of ( 1321 ) is very dependant on the behaviour of the weight. In Figure [2] contours of J are 
plotted for different weight functions. In each case, the minimum point is readily observed and its location 
depends on the weight and weight null-space. 

In the following section, properties of the minimum point of J are identified for various conditions on w. 

4. Minimizing the bound 

Solution of equations (331) and (34) provide the point that minimizes the bound (32). The equations are 
non-linear and two dimensional, thus, in most cases, require numerical treatment. In this section we identify 
solutions or simplifications to (33) and (34) for specific weight types. Some of these weights are of importance 
since they appear in the important areas of integral transforms and integral equations. 

With functions of two or more variables it is common that an identifiable relationship between the variables 
is observed. That is, w(ti,t 2 ) = u>(<^(ii,£ 2 )) for some 4>. For singular weights, the null-space of </>, {(ti,t 2 ) : 
0(^1^2) — 0}: may be of interest since this may furnish the singularity structure of the integral. Below, we 
explore the properties of J for <p being the difference mapping on a square and generalise to more general 
null-spaces in other corollaries. 

Corollary 11 (Difference weight). Let w : (a, b) — > (0, 00) be integrable and let a < x±,x 2 < b. Then the 
bound 

/>b />b 

J(xx,x 2 )= / \xi-ti\\x 2 -t 2 \w\ti-t 2 \dt 2 dti. 

J a J a 

is minimized at the midpoint X\ = x 2 = s ^ ■ 

Proof. As stated in Corollary [101 J is minimized at the root of equations (33) and ([34) . Substituting the 
midpoint in (33) gives 

-(a+b)/2 r b 



(a+b)/2 J a 



~-t 2 

a + b 



wlti — t 2 \ dt 2 dti 



w\u — v\ dvdu - 



(a+b)/2 J a 
b 



(a+b)/ 2 J a 



2 

a + b 



t- 



w\t\ — t 2 \ dt 2 dt\ 
w\t\ — t 2 \ dt 2 dt\ 



0. 



where u . = a + b — t\ and v = a + b — t 2 are integral substitutions. The same treatment on (34) shows that the 
midpoint minimizes the bound □ 

The following two corollaries show that the simultaneous equations (33) and (34) may be decoupled under 
certain conditions for the weight. 

Corollary 12 (Separable weight). Let the conditions in Corollary [W\ hold. Furthermore, let w be separable, 
that is w(ti,t 2 ) = w\{ti)w 2 {t 2 ), where Wi are themselves weight functions defined on [cii, bi\, i—1,2. Then J 
is minimized at the median of each weight 

rb, 
w l (t l )dt i = Wi(ti)dU, i = l,2. 

J Xi 

Proof. Substituting w(ti,t 2 ) — wi(ti)w 2 (t 2 ) into (33) and (34) and simplifying produces the result. □ 

Corollary 13 (Symmetric weight). Let the conditions in Corollary [W\ hold and let w : (a, b) x (a, b) — > K be 
symmetric, that is, w(ti,t 2 ) = w(t 2 ,ti). Then the minimum point is at x\ = x 2 . 

Proof. With the above conditions, the two equations in Corollary \W\ are 



(37) 
(38) 



dj_ 

dx\ 

dj_ 

dx 2 



{xi,x 2 ) 



(xi,x 2 ) 



\x 2 - t 2 \w(ti,t 2 )dt 2 dti 



a J a 
x 2 rb 



\x 2 - t 2 \w(ti,t 2 )dt 2 dti 



x\ J a 
b r b 



rx 2 po po po 

I j \xi - ti\w(ti,t 2 ) dt x dt 2 - j I \x\-t 



\w(ti,t 2 ) dtidt 2 



Beginning with ( |37|) we have 
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Xt rb rb rb 



- — {x!,x 2 )= J J \x 2 -t 2 \w(ti,t 2 )dt 2 dti- J J \x2-t 2 \w(t 1 ,t2)dt 2 dt 1 



a J a J x\ J a 

X\ rb rb rb 



rX\ ro ro rO 

/ / \x 2 -t 1 \w(t 2 ,ti)dtidt 2 - / / \x2 — h\w(t2,ti)dtidt2 

J a J a J x\ J a 

rX\ rb rb rb 

I I \x2~ti\w(ti 1 t 2 )dt 1 dt 2 - / / \x 2 — ti\w(ti,t 2 )dtidt 2 

J a J a J x^ J a 



Thus the solution of 



is identical to 



dj . 

OX 2 



dJ I An a dJ ( A n 

- — (a:i,3!2j=0 and - — {Xi,x 2 )=0, 

OX 1 OX2 



dJ l An a dJ ( An 

- — {x 2 ,xi) = and - — {Xi,X 2 )=0, 

ox 2 0x2 



and hence the solution occurs at X\ = x 2 . □ 

In Corollary \\T\ we showed that if a weight has a "difference" null-space on a square then the bound ( J32| ) is 
minimized at the centre of the square. The following corollary will generalise this result and we will consider 
a null space of the form t\ — 4>(t 2 ) where <j) is anti-symmetric on a rectangle. 

Corollary 14. Let w : (—a, a) x (— A, A) — ► (0, 00) be a weight function of the form w{ti, £2) = w\ti — <p(t 2 )\, 
where <f> : (— A, A) — > (—a, a) is surjective and odd, for some a, A > 0, that is </>(—£) = —<fi(t). Then J as 
defined in (SMj) is minimized at the origin. 

Proof. We need to show that 

/0 pA pa pA 

\ \h\w\tx-<t>{t2)\dt 2 dtx= / \t 2 \w\t 1 -0(t2)\dt 2 dt 1 
-a J -A JO J -A 

and 

/0 pa pA pa 

/ \tl\w\tl-<l>fo)\dtldt2= / \t 1 \w\t 1 -4>{t2)\dt 1 dt2. 

-A J -a JO J -a 

Making the substitution t\ — —u and t 2 = —v in the first integral of ( [39] ) we have 

c-A r-a c-A 

I \t2\w\t1 — <fr(t 2 )\ dt 2 db\ = / / \v\w\u — (f>(v)\ dvdu. 

-a J -A Jo J -A 

Similarly 

r0 pa pA pa 

/ \ti\w\t\ — 4>(t 2 )\ dtidt 2 = I I \u\w\u — 4>{v)\ dudv. 

-A J -a Jo J -a 

Hence, the corollary is proved. □ 

5. CUBATURE AND GRID GENERATION 

Theorem \9\ can form the basis of a cubature formula for weighted double integrals. That is, we can form a 
mesh and apply equation ( [201 ) to each grid rectangle. The minimum point of each rectangle would be given 
by ( [331 and (134|). The question that would remain is how would such a grid be "optimally" constructed? For 
example, for four grid rectangles, as shown in Figure \S[ how would £1 and £2 be chosen? 

Let us consider a partition aj < £j < bi of the interval [ai,bi], with Xi t i G [oj,£i] and Xi >2 € [£i,frj], for 
i = 1,2. In addition, define D to be the rectangular region [ai,bi] x [02,62] and define the sub-regions 
£>i,i = [ai,£i] x [02,6]) °i,2 = [£i,&i] x [02,6], Ai,i = [aij£i] x [£2,62] andD 2 ,2 = [£1,61] X [6,^2]- A sketch 
of this partition is shown is Figure SI 
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Theorem 15. Let the conditions in Theorem\9\ hold. Given the partition defined above, the following double 
integral inequality holds 



(41) 



(J f(t 1 ,t 2 )w(t 1 ,t 2 )dt 1 dh-J2(ll 



;+£>2,i 



f(xi l i,t 2 )w(ti,t 2 )dtidt2 



£>i,l+-D»,2 



f(t 1 ,x2,i)w(t 1 ,t2)dt 1 dt 2 ) +y^y^/(a; 1 ,j,a;2,i) // w(t 1 ,t 2 )dt 1 dt 2 



< 



d 2 f 



dt 1 dt 2 



i=X j=l 

2 2 

X!X! // l x M — *l||i>72,j -t 2 \w(t 1 ,t2)dt 1 dt 2 . 

1=1 j=l • 



The bound is minimized at the points Xij,£i, (i,j — 1,2J satisfying 



(42) 



(43) 



(44) 



(45) 



(46) 



\ \x 2! i — t 2 \w(ti,t 2 )dt 2 dh+ / \x 2>2 — t 2 \w(ti,t 2 )dt 2 dti 

a\ J a.2 J a\ •* £,2 

fl /"?2 (-Cl (-62 

/ \X 2 ,1 -t 2 \w(ti,t 2 )dt 2 dti + / / |x 2 ,2 — *2|w(*l)*2)dt2d*l, 

xi, 1 ^02 Jn>i,i J £,2 

a;i,2 /-?2 fii,2 /-62 

/ \X 2> 1 - t 2 \w(t-L,t 2 ) dt 2 dt! + / |x 2 ,2 - t 2 |w(t 1; t 2 )^2^1 

?1 ^a 2 J?l ^£2 

bi /"?2 rbi rb 2 

/ \x 2 ,i —t 2 \w(ti,t 2 )dt 2 dti+ / |a;2,2 - ^|w(ti,t 2 )(it2^i, 

£1,2 ^02 Jz)l x 2 J& 

:E2,1 /"Cl /" a; 2,l /■&! 

/ \xi,i -ti\w(ti,t 2 )dttdt 2 + / |ari,2 — ti|io(ti,t2)dtidt2 

02 «/ai J (12 J £1 

£2 r£i r£,2 rbi 

/ \xi.i -ti\w(t ll t 2 )dt 1 dt 2 + / 1x1,2 -*i| w(ti,t 2 )dtx dt 2 , 

X2,\ Ja\ J x 2 ,i J £1 

2:2,2 p£l f x 2,2 pbi 

/ \xi : i ~t 1 \w(t 1 ,t 2 )dt 1 dt 2 + / |jci,2 -ti\w(ti,t 2 )dtidt 2 

?2 Joi ^?2 •'fl 

b 2 r£i rb 2 rb\ 

\ \xi,i -ti\w(ti,t 2 )dtidt 2 + / \xi t2 -ti\w(ti,t 2 )dtidt 2 , 

2:2,2 «'ai ■'2:2,2 •'Ci 

Xi,i + xi, 2 , ,. X 2 ,l + X 2 , 2 
6 = o ~ and ?2 = — S • 



Proof. To obtain (]4T), it is a simple matter of applying equation (20| ) of Theorem 0] to each region Djj 
(i,j = 1,2), summing and finally employing the triangle inequality. 

To show equations pZ)— ([36), we calculate the stationary point of the bound 



(47) 
For in, 



i=i j=i 



J r = J^^ // |zi.j -ii||x 2 ,j -t2\w(t ll t 2 )dt 1 dt2- 



^ = ^{llJI Dij l*M -*ilKi -faK*i,fa) Axdfe} 



9xi, 



2:1,1 /-?2 



1 V* ./«! </ Q2 

?i /■£= 



(»1,1 -*l)|X2,l - t 2 \w(ti,t2) dt 2 dt 1 



+ / (ti-«i,i)|a;2,i-*2|«j(ti,t2)dt2dti 



2:i,l •'Os 

2:1,1 /-b2 



+ / / (xi,l - tl) |*2,2 ~ t2\w(t 1 ,t 2 )dt 2 dti 



ai J ^2 
?i z-62 



2:1,1 •/£: 



+ / / (£1 -xi,i)|x 2 ,2 -t 2 \w(ti,t2)dt 2 dt- l 
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= / |xa,i - t 2 \w(tx,t 2 ) dt 2 dti - / / |a?2,i - t 2 \w(ti,t 2 )dt 2 dti 

/•^l.l pbz t>£i pb2 

+ 1 / |^2,2 — t 2 \w(tl,t 2 ) dt 2 dtl — / / 1*2,2 — *2|«'(il,i2)rfi2^1- 

J ax J £,2 Jxi t i J £2 

Setting the last expression to zero gives ( [42] ) and the same process can be used to show equations (j43|)-([45 
To show ( 36) , observe that 



■kt- = / {£i-xi,i)\x 2 ,i-t 2 \w(£i,t 2 )dt 2 - / (#1,2 - £i)| #2,1 -h\w(£i,t 2 )dt 2 

Ct.1 Ja 2 Ja 2 

/•i>2 /"&2 

+ / (£i -#i,i)|#2,2 -t 2 \w(^!,t 2 ) dt 2 - / (xi t2 -^i)\x 2t i-t 2 \w(^ 1 ,t 2 )dt 2 

J& -l(,2 

«2 

2 



2^ 2 (gl - a:i ' 1 ^' 2 ) 1^2,1 -t a lw(ei,t2)dt a 

+ 2|'' a ^ 1 - a:M + a:i - 2 )|a; 2 , 2 -t 2 K£ 1 ,t 2 )dt 2 , 



'6 

which obviously has a root at {46) -i. Similarly, we can show (H6) 2 . D 

We now proceed to a full weighted cubature formulae. 
Define the following partitions of the intervals [a,, 6»] 

h '■ Chi = Si,0 < £,1 — ' ' ' — Si,« = &*) 

and let a^j G [&,j-i,£t,j] for i = 1,2 and j = 1,2, . . . ,n. Furthermore, let D h3 = [£i,j-i,£i,i] x [£2,3-1, £2,3], 

A (1) = ULi A,* and £>f } = ULi Ay, fOT «,i = 1,2, . . . , n. 
Consider the weighted cubature formula 

(48) A(f,w,h,I 2 ,£,x) 

= y^( // ! f( x i,i^2)w(t 1 ,t 2 )dt 1 dt 2 + // ^ f(ti,x 2 ,i)w(ti,t 2 )dhdt 2 J 



n n 



-535Z/(a:i,i,X2,j) // w^t-i) dtidt 2 . 

t=l 3=1 JJDi,i 

Using the above assumptions, we can write the following theorem. 

Theorem 16. Lei / : [ai,6i] X [a 2 ,6 2 ] — ► R a?i(i it; : (ai,6i) x (a 2 ,6 2 ) — > (0,oo) oe as m Theorem^ and 
I\i d 2 i £, * & e given above. The following weighted cubature formula holds 

i>b\ t>b2 

(49) / / f(t 1 ,t 2 )w(t 1 ,t 2 )dt 2 dt 1 = A(f,w,I 1 ,I 2 ,£,x) + R(f,w,I l ,I 2 ,£,x), 

where 

d 2 f 



(50) \R(f,w,h,I 2 ,Z,x)\< 



dhdt 2 



oo i=1 j- =1 ^ ■'-Dj 



5ZX! // kl,J-*l||#2,j -<2|w(*li*2)dtldt2. 



TTie bound (5U \) is minimized when x and ^satisfy 

( 51 ) zJ / / \ x 2,j -h\w(t 1 ,t 2 )dt 2 dti = 22 / 1*2,3 - t^w^,^) dt 2 dti 

j=l J£l,i-1 J &.3-1 3 = 1 •'^l.i •'$2,3-1 

" /-^.i r?i,3 n r&.i /-?i,j 

(52) 23/ / \x 1 , j -ti\w(t 1 ,t 2 )dt 1 dt 2 = 22 / 1*1,3 -tilto^i.tajdtidta 

3=1 •'61,1-1 •'£1,3-1 3=1 •'^2,. •/ £1,3-1 

(53) £k,i = — — 2 M+1 , /or i = l,...,n, £ = l,...,n- 1, fc = 1,2. 

Proof. The proof follows that of Theorem [LSI D 

To find the An — 2 unknowns 

£j,l < 6,1 < %i,2 <•• < &,n-l < Xi <n , 

for i = 1, 2, we need to solve the An — 2 coupled non-linear equations (15T), ( 152) and (153). These equations are 
easily solved iteratively with a uniform grid as the starting point. With this method of solution all variables 
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are fixed apart from the parameter of interest. Thus for example if k = 1 and we fix i, then equation ( 151 j) may 
be considered as a function of x\.i only; say F(x\^). It is easy to see that 

F'(x l>i ) = 2j2 / \x 2 , J -t 2 \w(x 1 . z ,t 2 )dt 2 >0 

and F(£ii_i) < 0, F(^ii) > 0. Thus F has a unique root and the bisection algorithm would be an appropriate 
numerical technique to produce the solution. 

In Figures 31 [51 El and \7\ the grid obtained via numerical solution of ([51])— ([53]) is plotted for various weight 
functions and n. We can see that the grid clustering reflects the weight behaviour. 
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Figure 1 . Surface and contour plots of the Peano type kernels P defined in ( [17] ) for different 
weights, (a) w(ti,t 2 ) = — ln(iii2) over the unit square and x\ = x 2 — 0.5, (b) w(ti,t 2 ) = 
\/ti/t 2 over the unit square and x\ = x 2 = 0.5. 
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Figure 2. Contour plots of the J{x\,X 2 ) given by ( [32] ) for various weight functions, (a) 
w{h,t 2 ) = -\a(tih), (ii,i 2 ) e (0,1) x (0,1), (b) w{t u t 2 ) = -ln|ti-t 2 |, (t u t 2 ) € 
(0,1) x (0,1), (c) w(h,t 2 ) = -]n|t a - t||, (*i,t 2 ) G (0,1) X (0,1) and (d) u;(ti,t 2 ) = 
e- fl /Vh, (ti,t 2 )G (0,4) x (0,1). 
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Figure 3. A partition of the rectangular region D = [ai, 6i] x [a 2 , 6 2 ] showing the sub-regions 
A,j, ijj = 1,2. 
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Figure 4. Grid generated from the solution of equations ([51])- ([53] ) for the weight u>(ti, £2) = 
\A2A1 over [0, 1] x [0, 1] and n = 10. The solid lines indicate the composite grid; in each grid 
square there is one function evaluation (dot) and two single integral evaluations (dashed lines). 
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Figure 5. Grid generated from the solution of equations (51)- ( [53] ) for the weight w(ti,t2) = 
— In(tit2) over [0, 1] x [0, 1] and n = 10. The solid lines indicate the composite grid; in each 
grid square there is one function evaluation (dot) and two single integral evaluations (dashed 
lines) . 
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Figure 6. Grid generated from the solution of equations ([51])- (531 ) for the weight w(ti,t 2 ) = 
— In(iii2) over [0, 1] x [0, 1] and n = 30. The solid lines indicate the composite grid; in each 
grid square there is one function evaluation (dot) and two single integral evaluations (dashed 
lines) . 
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Abstract 

Let {p n (x)}%Lo be a sequence of polynomials of degree n, defined by a special 
class of generating function generalizing the generating function of the Laguerre 
polynomilas, by using the factorization method, introduced in [1], we determine 
their differential equations, and some other properties. 
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Introduction 



In recent articles [4], [5], [8], different versions of the factorization method [1] have 
been used in order to investigate properties of some recurrent polynomials {p n (x)}'^ ) =Q , 
where p n (x) is a polynomial of degree n. 

G. Dattoli et al. (see [2], [3]) introduced the so called monomiality principle as 
an instrument useful to study in an unified way a large class of special functions, in- 
cluding the multivariable Bessel functions, Hermite and Laguerre polynomials, classical 
Bernoulli and Euler polynomials and many others. 

A family of special polynomials is called quasi monomial if there exist two operators 
P and M, satisfying the commutation property: [P, M] := PM — MP = 1, and such 
that: 

P(p n (x))=np n - 1 (x), M(p n (x))=p n+ i{x). (1.1) 
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This definition can be extended to some class of special functions, including the 
multi-variable or multi-indices cases. 

From equation (1.1) some important properties of the polynomial family can be 
easily deduced, such as: 



the differential equation: 

the explicit form: 

the generating function: 



MP(p n (x)) = np n (x) 
p n (x) = M n (l), 



OO J?l 

e tM(1) = E h p»W- 

71=0 "'■ 



A more classical approach, connected with the work of L. Infeld and T.E. Hull [1], 
has been considered by M.X. He and P.E. Ricci [4], [5]. They define two operators L~ 
and L+, depending on n, satisfying the properties: 

L nPn(x) =p n - 1 (x), L+p n (x) =p n+1 (x), (1.2) 

and consequently 

( L n+l L n)Pn{x) =Pn(x), (L+^L^p^x) = p n (x), (1.3) 

(L n _ k+1 L n _ k+2 ' ' ' L n _iL n )p n (x) = p n _ k (x), (1.4) 

(L+_,L+_ 2 • • • Lm)p (x) = p h (x). (1.5) 

Equations (1.3) give back the differential equation satisfied by the polynomials, and 
furthermore the formulas (1.4)-(1.5) permit to recover the operator L+ (L~) when the 
operator L~ (L+) together with a linear recurrence relation of any order satisfied by 
the polynomials p n (x) is known. 

It is worth to note that, the knowledge of the exponential generating function 



CO 



G(s,i):=5> n (a0- (1.6) 

^o nl 

of the given family of polynomials often permits the construction of the operators 
(1.2). A very useful survey about different classes of generating functions is given in 
the classical book of H.M. Srivastava and H.L. Manocha [6]. 

In recent papers, the above methods have been used in order to derive the differen- 
tial equations satisfied by polynomials belonging to the Appell family [4], [5], [7] and 
some particular case of rf-orthogonal polynomials (see [9], [10], [11]). 
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Appell polynomials are characterized by the generating function 

Y / - ] Pn(x) = A(t)e* t , p (x) = l, (1.7) 

with A(t) being a function of t without singularities at t — 0. The main characteristic 
of A(t) is that 

-i-Pnix) = np n -i{x) (1.8) 

dx 

according to which the ordinary derivative can be viewed as a negative shift operator 

in the discrete index n. 

It has been recently emphasized that polynomials characterized by the generating 

function [7], [12], [13] 

oo ±n 

G(t, x\0) = ]T -Ux) = B(t)C (xt) (1.9) 

with B(t) playing the same role of A(t) and 

n (—lY r r 
r=o (r\) 
being the th order Tricomi function, are characterized by the property [2] 

-— x— l n (x) = nl n -i(x) (1.11) 

which shares some analogy with the properties of Appell polynomials. Such analogy is 
reinforced by the fact that the operator 

- d d , 

v *'-=-Tx x Tx (L12) 

is recognized as a Laguerre derivative within the context of the monomiality principle 
applied to Laguerre polynomials [2], [13]. 

Since for B(t) = e* the equation (1.9) gives back a family of Laguerre-type polyno- 
mials, we will denote by l n (x) the relevant polynomials. 

As a further example we consider the case 

A(t) = C m (t) 

_ - (-iyx r (i-i3) 

where C m (t) is the m th order Tricomi function, eigenfunction of the operator 

d 2 , ,d , , 

V t , m = -t^ 2 -(m + l) Jt (1.14) 
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In this paper we address the problem of obtaining general conditions on the func- 
tion B(t) to derive the differential equations specifying l n (x). A first effort to derive 
differential equations for the Laguerre type family has been put forward in ref. [7], 
here we will develop a more systematic strategy also employing the methods of refs. 

[4]-[5]. 

2 Laguerre type polynomials and relevant differential equa- 
tions 

We have already remarked that the th order Tricomi function is an eigenfunction of 
the Laguerre derivative and that the m th order functions are eigenfunctions of the 
operator defined by equation (1.5). We will therefore derive the differential equations 
for families of Laguerre type polynomials by just taking advantage from this fact and 
consider as introductory example the generating function 

G(t;x\m) = f:t%^ = ^^, (2.1) 

with 

4 m) (*) = E }~ X) \r ( 2 - 2 ) 

By applying the operator (1.5) to both sides of 

{l-t)G(x,t\m)=C m (xt), (2.3) 

we find for the generating function 

d 2 dG 

t(l-t)—G+ [(m - l)t - (m + 1)] — + [m + 1 + x(l -t)]G = 0, (2.4) 

which provides the recursion 

*n+l( a; ) = 7 1 xn t-n-l\ X ) J t~ 

n — m — 1 L 
+ [n(n - 1) - n(m -l)-(m + l)-x] 4 m) 0*0} • 



(2.5) 



In this case the second recursion is provided by (see equation (1.5)) and the con- 
cluding section) 

V x , m l^\x)=nlt\(x) (2.6) 

so that we can write the differential equation for the /^(a;) polynomials in the form 

xVl m l^\x) + [( n -l)(n-2)-(n-l)(m-l)+ 

+ m-l-x]T) x , m l^ix) =n(n-m-2)l ( ™\x), 
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to this aim it is worth noting that the Laguerre derivative satisfies the relations [2], [13] 

v ' dx n dx n 

n+k (2-8) 



The method we have just outlined can be generalized, to this aim we consider the 
generating function 



t 



F(x,t\m) = £ -1 ^\ x ) = B(t)C m (xt), 



n=0 



n 



and on account of the identity 



dx s 



c n {x) = (-iyc n+s (x) 



it is shown that the polynomials \^ n \x) satisfy the recurrence 

d 



dx 



\^\x) = -n\tt 1] (x). 



(2.9) 



(2.10) 



(2.11) 



Limiting ourselves to the case when m = 0, we find that the generating function 
satisfies the equation 



-t-T(x,t\0) = [x + S o (t) + S 1 (t)]V(x,t\0) + 



where 



S (t) 



+ 2xtS (t)T(x,t\l) 

1 dB(t) 1 d"B(t) 

-, Jiyt) — —- 



B(t) dt ' " lv " y B{t) dt 2 ' 

By assuming that 5o,i(t) are known through their series expansion 



<M*) = E 



r=0 



j, i 



(2.12) 



(2.13) 



(2.14) 



we obtain for the A^ -* (x) the following recursion involving the A^ (x) 

\ ( nli{x) = ~ \xX { n ] {x) + 4 0) (x) + n£\x) + 2xn ln ^{x) 



n 



!L a (o,i) 



™ — n ' • 



r=0 



(2.15) 



in{x) = E^r^^) 



X 



r=0 
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which allows the derivation of a rather involuted differential equation satisfied by the 

Ai 0) (*). 

Simpler results can be obtained by using the relation 

T(x,t\0)^=C (xt), (2.16) 

in this case assuming known the expansion 

1 OO J.S 

w = &* (2 - 17) 



we end up with the recursion 

q n +i(x) = — q n , 

n + 1 

(2.18) 



which has the advantage of not involving the polynomials X^(x) and yields a less 
involved differential equation. 

Further comments will be presented in the concluding section. 

3 Concluding remarks 

In this paper we have considered Appell and Laguerre type polynomials. Let us remark 
that, in general, if 

A(*) = £°f, B(t) = E^ (3-1) 

we find 

Pn(x) = Yl ( „ )an~rX T , 

(3.2) 



r=0 \ r / 



Ux) = ±h b -^x r 



r=0 \ r J rl 



Furthermore if A(t) and B(t) possess the Fourier transforms A(a) and B(a), we find 
that the polynomials (3.2) are specified by the integral transforms 

\ r+oo „ 

Pn( x ) = — / A(a)(x + ia) n da, 

v27T J — oo 

(3.3) 
l n (x) = —7= / B(cr)L n (x,ia)da, 
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where L n (x,y) are the two variable Laguerre polynomials (see ref. [13], [14]), namely 

L n (x,y) = n\Y: / } l n2 (3.4) 

^ (n -r)\{r\y 

In a forthcoming investigation we will analyze the case of multi-index polynomials 
and show how the techniques of this paper can be suitable extended to obtain the 
differential equations specifying multi-index and multi-variable Appell and Laguerre 
polynomials. 
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Abstract 

We establish an asymptotic formula for general sequences of positive 
linear operators acting on spaces of continuous functions denned on 
convex subsets of (possibly infinite-dimensional) Banach spaces. 

Moreover we deduce analogous formulae for the Bernstein-Schnabl 
operators on bounded convex subsets of separable Banach spaces, as well 
as for the multidimensional extension of the so-called Szasz-Mirakjan 
operators. 
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1 Introduction 

In the recent paper [2], the authors have determined some general conditions 
under which a sequence (£ n ) n >i of positive linear operators acting on a func- 
tion space E satisfies an asymptotic formula of the type 

lim \\n(L n (f) - f) - A(f) || = (/ e D(A)) 
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with respect to a weighted norm on E, where A is a second order differen- 
tial operator defined on a suitable subspace D(A) of E of twice continuously 
differentiable functions. 

The space E involved in the previous formula is a function space on some 
(possibly unbounded) real interval. 

In this Note we treat of the more general case in which E is a space of 
real-valued functions defined on a convex subset K of an arbitrary Banach 
space. 

In such a general setting, the second order differential operator is replaced 
by an "abstract differential" operator of the form 

A{f){x) := l 2 T{f{x)){x) + f'(x)((3(x)) + 7 (aO/(aO, 

where f is a sufficiently smooth function on K , T an operator acting on the 
space of all bilinear continuous forms on X, j3 a mapping from K into X, 7 a 
real- valued function on K and f'(x) and f"(x) denote the first and the second 
Frechet derivatives of / at x E K . 

Our main results established in Section 3, besides having an intrinsic in- 
terest of their own, allow us to obtain some asymptotic formulae for several 
sequences of positive linear operators as, for instance, the Bernstein-Schnabl 
operators, defined by 

B n (f)(x) := / ••• / /(— -)dj2 x (xi)---dfi x (x n ), 

Jk jk V n / 

where if is a bounded convex subset of a separable Banach space, (fJ, x )xeK 
is a family of probability Borel measures on K, and / belongs to a suitable 
function space on K. 

In this special case, our results generalize those of [1] and [8]. 

We also discuss in detail the finite dimensional case extending, among 
other things, some results of [9]. As an application, we establish an asymp- 
totic formula for the multidimensional extension of Szasz-Mirakjan operators 
introduced in [10]. 



2 Notation and preliminary results 

Let (X, || • ||) be a Banach space and K a convex subset of X . For all x G X 
we shall consider the following mappings 

ip x :yeK^y-xeX; (2.1) 

^:yeK^(y-x,y-x)eX 2 ; (2.2) 

d x :y eKt->\\y-x\\e R. (2.3) 
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Moreover, the symbols .F(X,R), F(K,X), X' and C (2) (X,R) will stand re- 
spectively for the space of all real functions defined on X, the space of all 
mappings from K into X, the space of all linear and continuous forms on X 
and, finally, the space of all bilinear and continuous forms on X. 

The spaces X' and £( 2 \X, R) will be equipped with the natural norms 

|| w || ;= \\u\\ X ' ■= sup{\u(x)\ : x e X, \\x\\ < 1} (u E X') (2.4) 

and 

INI := II^IL(2)(x,R) : = snp{\v(x,y)\ : x,y E X, \\x\\ < 1, \\y\\ < 1} (2.5) 

(t)G£( 2 »(I,R)). 

In the sequel we shall also consider the linear subspace of T{K, X) 

S(K,X) := span({^ x : x E K}) (2.6) 

spanned by the set {ip x : x E K}. 

Let us now introduce the function spaces in which the next theorems are 
mainly set. 

By C(K, R) we denote the space of all continuous real- valued functions 
defined on K; moreover, C 2 (K, R) stands indifferently for the space 

C {2) (K, R) : = {/ 6 C(K, R) : / is the restriction to K of a twice 

continuously differentiable function defined on an open 
neighborhood of K} 

if the interior Int(K) of K is empty, or for the space 

C 2 {K, R) : = {/ E C(K, R) : / is twice continuously differentiable on Int(K) 
and its derivatives admit continuous extensions up to K} 

if Int(K) ^ 0. Thus, in this second case, for any / E C 2 (K, R) by the symbols 
/' and /" we will mean the continuous extensions to K of the derivatives of f 
in Int(K). 

If X is finite dimensional, K is bounded and the boundary OK of K is 
sufficiently smooth, then C^(K,R) C C 2 (K,R). 

We also set: 

Ch(K,R) := {/ E C 2 (K,R) : sup \\f"(x)\\ cm{x>R) < +oo}, (2.7) 

and 

UC 2 b (K, R) := {/ E C 2 b (K, R) : /" is uniformly continuous on K}. (2.8) 
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Let E be a linear subspace of T{K, R) such that 

{/ G C(K,K) : sup l/( ,^ L < +00 } c E . (2.9) 

x€K 1+ \\X\\ 2 

By virtue of (2.9), for any ueX',v G £ (2) (X, R) and s G X, 

uo^eE and voijj^eE. (2.10) 

For every / G C 2 ^, R), x,y <E K set 

{0 if x = y, 

f(y)-f{x)-f'{x){y-x)-\r{x){y-x,y-x) • , / ( 2 - U ) 

II 119 I I Jb -f— U 

\\y—x\\ z J i a 

In what follows, unless otherwise stated, we shall consider a function / G 
C 2 b (K,R) and set 

M := sup xeK \\f"(x)\\ C (2 ){X:R) . 

First notice that necessarily / G E and u;/(s, -)d 2 G -E; indeed, if Int(K) ^ 0, 
pick Xo G Int(K); for any re G Int(K), by the Taylor's formula with integral 
remainder (see for instance [6], § 4.3.1) 

f(x) = f(xo)+f(x )(x-x )+ / (l-t)f"(x + t(x-xo))(x-x ,x-xo)dt, 

Jo 

and thus 

|/(x)| < |/(x )| + ||/'(xo)|| \\x-x || +y || a; - x || 2 ; (2.12) 

by continuity this last inequality may be extended to K and therefore, by 
(2.9), / G E. This, in turn, together with (2.10), implies that for any x G K 
u)f(x,-)d 2 x G E. By analogous reasonings one may deduce that / G E and 
Wf(x, -)d 2 x G E for any x E K also in the case Int(K) = 0. 

Moreover, it will be useful to notice that the function x G K \— > \\f'(x) || G R 
is itself an element of E, too; in fact, if Int(K) ^ we may choose Xq G Int(K) 
and then, as a consequence of a mean- value theorem ([6], Corollary 2.1), for 
any x G K 

\\f{x)\\ < \\f(xo) || +M ||s -so ||; (2.13) 

by continuity the same inequality holds for any x G K. One may argue 
analogously to reach the same conclusion if Int(K) = 0. 

Assume now Int(K) ^ 0; for every x,y G Int(K) by the definition (2.11) 
and by using the Taylor's formula with integral remainder we deduce that 

u f (x,y)d 2 x (y) = f (l-t)f"(x + t(y-x))(y-x,y-x)dt+ (2.14) 

•J 

\l - t)(f"(x + t(y - x)) - f"(x))(y -x,y- x)dt. 
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Notice that the function O0f(x, y)d 2 (y) is continuous with respect to both x and 
y, and therefore, by the continuity of /", and by the Lebesgue's dominated 
convergence theorem, the equality (2.14) holds also true for any x,y G K. 
The previous formula implies that 






| cu f (x, y) | d 2 x (y) <2M\\y-x\\ 2 f\l - t)dt = M\\y 

and therefore for any x,y G K 

\uj f (x,y)\<M. (2.15) 

Finally notice that if, in addition, /" is uniformly continuous, then 

\im Uf(x, y) = uniformly w.r.t. x G K. (2-16) 



y-,x 



Indeed, for a given e > 0, there exists S > such that \\f"( x ) ~ f"(y)\\ — £ 
for any x,y G K for which ||a; — y\\ < 5; therefore, for any such x and y, and 
by virtue of (2.14) one has 

-1 



e „ „ 2 



I u f (x, y) | d x (y) <e\\y-x\\ / (1 - t)dt = - \\ y - x 

Jo I 

whence it follows | cuf(x,y) \< |. 

3 Asymptotic formulae on convex subsets of 
Banach spaces. 

In this section we establish an asymptotic formula for an arbitrary sequence 
of positive linear operators. The particular case in which these operators are 
associated with a family of measures is considered as well. Finally we develop 
similar results in finite dimensional settings. 

As in the previous Section we shall consider a Banach space X, a convex 
subset K of X and a linear subspace E of J-"(X, R) satisfying (2.9). 

Theorem 3.1 Let (I/ n ) nS N be a sequence of positive linear operators from E 
into F(K, R), and consider a divergent sequence (v?(n)) nS N of positive integers. 
Given a mapping 

T:£ (2) (X,R) ->F(K,R), 

w, 7 G F(K, R) and /3 G ^(K, X), consider the operator A : C 2 (K,R) i— > 
T{K, R) defined by setting for any f G C 2 (K, R) and x G K 

A(f)(x) := \T{f'{x)){x) + f(x)((3(x)) + i(x)f(x) (3.1) 

and assume that 
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(%) for k=0,2 

lim||x|| w(x)[(p(n)(L n (i)(x) — i)—^(x)] = uniformly w.r.t. x G K; 

(ii) for every n G N there exists a linear operator L n : S(K, X) i— ► T(K, X) 
such that for every x G K and u G X' 

L n (u o ip x ) = u o L n (tp x ) (3.2) 

and for k—0,1 

\im\\x\\ k w(x)[(p(n)L n (vl) x )(x) — j3{x)} = uniformly w.r.t. x G K; 

(Hi) for every bounded subset B C £^ 2 ^(X, R) 

linnu(a;)[<p(ra)(L n (u o ip x 2) )){x) - T(v)(x)} = 

uniformly w.r.t. x E K and v G B. 
Then 

(1) for any f G C 2 (K, R) such that 

lim w(x)[ip(n)(L n (u>f(x, •)d 2 ))(x)] = uniformly w.r.t. x E K (3.3) 

one gets 

lim^(n)(L n (/) - /) - A(f)} = (3.4) 

uniformly on K. 

(2) If 

k := sup w(x)(p(n)L n (d x )(x) < +oo, (3.5) 

x<=K,n>l 

and if there exists q G]2, +oo[ such that d x G E for all x G K and 

lim w{x)ip{n)L n {d x ){x) = uniformly w.r.t. x E K, (3.6) 

i/ien for every f G UC 2 (K,R) formula (3.3), and therefore also formula 
(3.4), hold true. 

Proof. (1). Consider / G C%(K,R); by (2.11) we deduce that 

/ = f(x)l + f(x) o i, x + i/»(x) o v£ + w/ (x, •)<£, (1) 

whence we obtain that for every n > 1 
L n (/) = f(x)L n (l) + L n (f(x) o ^) + ^L n (/"(x) o ^) + Ln (tu f (x, -)d x ). 
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Then, taking (3.2) into account, 

|^)b(n)(L n (/)-/)(z)-i(/)(x)]|< 

< \w(x)f(x)[ip(n)(L n (l)(x) - 1) - 7 (x)] | + 

+ | w(x)((p(n)[L n (f'(x)oip x )(x) - (f(x)op)(x)} | + 

+ | w{x)[^L n {f\x)o^ x ){x) - \T{r{x)){x)\ | + 

+ \w(x)ip(n)L n (u f (x,-)d 2 x )(x)\< 

< | w(x)f(x)[<p(n)(L n (l)(x) - 1) - 7 (x)] | + 
+ || /'(x) || ||w(x)(^(n)L n (^)(x) -/3(x))|| + 

+ ^(x)(^(n)L n (r(x) o^)(x) -T(/"(x))(x))| + 

+ | w(x)(p(n)L n (ujf(x,-)dl)(x) | . 



By virtue of hypothesis (i), inequality (2.12), and the fact that ||x|| < 
1 + ||x|| 2 for all x G K one obtains 



limw{x)f(x)[cp(n)(L n (l)(x) - 1) - 7 (x)] = 

uniformly on K. 

Moreover, combining (2.13) and condition (ii) we deduce that 



(2) 



lim||/'(x)|| \\w(x)((p(n)L n (ip x )(x) - (3{x) 







(3) 



uniformly on K. 

On the other hand, as /" is assumed to be bounded on K, the set B(f) : = 
{f"(x) e £ (2) (X,R) : x e K} is a bounded subset of the space C^(X,R), 
and thus condition (iii) applies, implying that 



]imw(x)[<p(n)(L n (f"(x)oij;l)-T(f"(x)))(x)] = 



(4) 



uniformly on K. Therefore, taking (3.3), (2), (3) and (4) into account we ob- 
tain the assertion. 

(2). Consider / e UC$(K, R), and set again M := sup x( zK \\f"( x )\\] we shall 
prove that (3.3) is fulfilled. 

To this aim, set first e > 0; as a consequence of (2.16) there exists 5 > such 
that for every x,y G K, \\x — y\\ < 5, one gets 



\uf(x,y)\ < 



2kn 



(5) 
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furthermore, (3.6) implies the existence of some v G N for which 

| w(xMn)L n (dl)(x) \< 2{M £ +1 f - 2 (6) 

for every x G K and n G N, n > v. 

Now set x,y £ K; if ||x — y|| < <5, then 



whereas, if ||a; — y\\ > 5, 



\uf{x,y)\d 2 x {y) < ^dl(y) 



f>i ,.\ / 1 I .1-1; ..\ ^ n/r^xiy) 



\u f (x,y)\d 2 x (y) < Md 2 x (y) < M- Sq 2 . 

In any case, for all x G K 

p d q 

\u f (x,-)\d 2 x <—dl + M^- 2 . (7) 

Consider n G N, n > u, and x G if; by (3.5), (6) and (7) we deduce that 

\w(x)tp(n)L n (uf(x,-)dl)(x)\ < 

< | w(x) | (p(n)L n (\u f (x, -)\dl)(x) < 

< | w(x) I (^(n)[ 2 ^L n (^)( x ) + — L n (4)(x)] < e. 

D 

In the following Corollary we shall apply Theorem 3.1 to a sequence of 
linear positive operators defined by means of a family (fJ> x ,n)xeK,n>i of Borel 
measures on K such that the linear subspace E introduced in (2.9) also satisfies 
the inclusion 

Ed H £\K,»x, n ), (3.7) 

x<=K,n>l 

where each C l {K, fi x ,n) denotes the space of all /i Xi „-integrable functions on K. 
In the sequel, if x G X and 5 > 0, we set 

B(x, 5) := {y G X : || x - y ||< 5}. 

Corollary 3.2 Let E be a linear subspace of J 7 (X, TV) satisfying (2.9) and 
(3.7), where ([A x ,n)xeK,n>i ^ s a given family of Borel measures on K, and let 
w G JF(X, R). Moreover, consider a divergent sequence of positive integers 
(<p(n)) n >i and the sequence (L n ) n >i of positive linear operators from E into 
TiK, R) defined by 

L n (f)(x) := / fd/ji x , n (feE,xeK,n> 1), (3.8) 

JK 
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and assume that it satisfies condition (3.5) of Theorem 3.1, namely 
k := sup w(x)(p(n)L n (d x )(x) < +oo; 

x<=K,n>l 

moreover, suppose that for any 5 > 

\im(p(n)w(x) / d x dfu, xn = (3.9) 

n JK\B(x,S) 

uniformly w.r.t. x G K. 

Then, for every f G UC 2 (K, R) 

lim w(x)[<p(n)(L n (u)f(x, ■)d 2 x )){x)} = uniformly w.r.t. x G K 

and therefore, if also (i), (ii) and (Hi) of Theorem 3. 1 are fulfilled, the asymp- 
totic formula (3.4) holds true. 

Proof. It suffices to adapt the proof of Theorem 2 of [2] to the present case. 

□ 

Remark 3.3 Note that, if each fi xn is finite (x G K, n > 1), then formula 
(3.2) holds true. Indeed, as \\if) x \\ < 1 + ^ f° r an y x e &■> Ik HV'xIM/^t.n < +oo, 
and thus ip x is Bochner-integrable with respect to each fi t , n - In order to obtain 
(3.2) it suffices to consider the linear operator L n : S(K,X) — ► !F{K,X) 
defined by 

L n (if)(t):= f <pdpt t , n {n>l,<peS(K,X),teK). 
Jk 

D 

In what follows we apply Corollary 3.2 to obtain an asymptotic formula 
for a particular sequence of operators of the form (3.8). 

Consider a Banach space X and a probability Borel measure \x on X such 
that 

| x || 2 dfi(x) < +oo (3.10) 



IX 

and for every u G X' 

u{x)d[x(x) = 0. (3-11) 



x 

2. 



Observe that f x \\x\\d/j,(x) < +oo, as for any x G X, \\x\\ < 1 + ||x|| ; 
moreover for all v G £^ 2 ^(X, R) the mapping x i— > i;(x, x) is /i-integrable, 
because it is continuous and for all rr G X | v(x,x) |<|| v \\ \\x\\ 2 . 
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Consider the space 



E : = {/ G C(X, R) : sup ^ ', „„' < +00}, 



1/0*01 

ce x 1 + ||x| 



which is included in C l (X, fi) by assumption (3.10). 

Moreover, for every f E E, x E X and n > 1 the function 1/ 1— >• /(x + -H£ J 

is /z-integrable too; indeed, set M := sup xgX ^hI; then 

|/(X+-7=)| <M(1+ ||X+-T= f) <M(1+ II X f +^=||a:|| ||y|| + i||y|| 2 ). 

'n \/n \/n n 



Let us now introduce the sequence of positive linear operators from E into 
F(X, R) defined by 

£„(/)(*):= f f(x + ^=)dti(y) (feE,xeX,n>l). (3.12) 
Jx v n 

Corollary 3.4 For every f G UC^(X) 

limn(L n (/)(x) - /(x)) = J / f"(x)(y,y)d f x(y) (3.13) 

uniformly with respect to x E X . 

Proof. We shall apply Corollary 3.2 to the special case in which K — X, 
w :— 1 and <^(n) := n for any n > 1; to this purpose we remark that the n-th 
operator in (3.12) may be regarded as a particular case of the one introduced 
in (3.8) if /i Xj „ denotes the image measure of \i under the continuous mapping 

First observe that for all n > 1 L n (l) = 1, and therefore condition (i) of 
Theorem 3.1 is fulfilled by setting 7 = 0. Consider now u E X' and x G X; 
then, taking (3.11) into account, for every zGlwe have 

f y 

L n (u o if) x )(z) = / u(z - x + —=)d^(y) = u(z - x) = u(i/) x (z)), 
Jx Jn 



and thus (3.2) is satisfied by denoting with L n 's the identity operator on the 
space S(X,X). Therefore, condition (ii) of Theorem 3.1 holds true for such 
L n and (3 := 0. 

Next notice that, as for every v G £( 2 \X, R), n > 1 and x G X 

r 11 11 1 r 

L n(voij;W)(x) = / v(—=, —=)dn{y) = - I v(y,y)dfi(y), 

JX \/Th \/Tl Th J X 
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also condition (iii) of the previously quoted Theorem 3.1 applies if we introduce 
the operator T : & 2 \X, R) -»• F(X, R) defined as 

T(v){x) := / v(y,y)dfi(y) (x G X). 

Jx 

We now show that the hypothesis of the preceding Corollary are satisfied; 
firstly we notice that 

L n (d 2 x )(x) = - \\y\\ 2 dfi(y) 
n Jx 

for any x G X; secondly for every 8 > 0, n > 1 and x G X we have 

n d l(y) dn x ,n(y) = 

Jx\B(x,8) 

= n / l x \B(x,5){x + —i=) d/i(y) = 

Jx \/n n 



ix\B(o,v^)(2/)IMI My) = I , ^ N \\y\\ My), 

where 1x\b(x,s) an d lx\B(o,^/nS) denote respectively the characteristic function 
of X \ B(x, 6) and X \ B(0, ^n8). 

From the above equalities, and by virtue of hypothesis (3.10), we may apply 
Corollary 3.2 to the present case and then (3.13) holds true. 

□ 

Let us turn our attention to the case in which K is a convex subset of R p , 
where p > 1. The space R p will be endowed with an arbitrary norm || • || 
and, in analogy with the notation introduced preliminarily to Theorem 3.1, 
the symbol C 2 (K,R) will stand indifferently for the space 

C (2) {K, R) : = {/ G C(K, R) : / is the restriction to K of a twice 

continuously differentiable function defined on an open 
neighborhood of K} 

if Int(K) = 0, or for the space 

C 2 (K, R) : = {/ G C(K, R) : / is twice continuously differentiable on 
Int(K) and its derivatives admit continuous extensions 
up to K} 

if Int(K) ^ 0. 



424 F.AItomare.S.Diomede 



Moreover, by T{K, W) we mean the space of all functions f from K into R p , 
while S(K, R p ) stands for the linear space spanned by the set {ip x : x G K}. 
Let E be a linear subspace of T{K, R) such that 

{/ G C(K,R) : sup J^L eR}GE, (3.14) 

X <=K 1 + \\X\\ Z 

and denote by pri : R p i— > R the i-th projection on R p , i.e., the mapping 
defined by setting 

pri(x) := Xi for every x = (xj)i<j< p G R p . (3.15) 

Before we state next theorem, we remark that if / G C 2 (K, R) and each of 
its second order partial derivative dx J x . is uniformly continuous and bounded, 
then the second order derivative /" of / is uniformly continuous and bounded 
on K, that is, / G UC 2 (K,H) according to the definition in (2.8). 

Indeed, by the sake of simplicity we assume that R p is endowed with the 
Euclidean norm || • ||. We recall that, if / G C 2 (K,TL), then / possesses 
continuous first and second partial derivatives on a neighborhood of K or, 
respectively, on the interior of K, which are continuously extendable up to K. 
Moreover, if a G K, then f'(a) : R p — » R is the linear form defined by 

p df(a) 

f'( a )(y) = E -jt 1 * (y = (w)i<i<p e RP ). 

while the second derivative of / in a is the bilinear form f"(a) : W x R p — > R 
defined by 

for every y = (^)i<j< p , z = (^)i<j< p G R p . 
Moreover, since 

P d 2 f(a) 

lir(«)IL( 2 )(R,,R)<E ■l^^rl. 

/" is bounded on K provided that each second partial derivative of / is 
bounded uniformly on K; furthermore, as for any a,b G K 

11/ («) - / WIU (RP , R ) < E 1^- - ^^:l. 

if all second partial derivatives are uniformly continuous on K, then /" is also 
uniformly continuous on K. 

We are now ready to present the following result. 
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Theorem 3.5 Let aij,/3i,'y,w : K ^ R be given (i,j = 1, . . . ,p), and con- 
sider a divergent sequence of positive integers (</?(n)) nS N and a sequence (L„) n eN 
of positive linear operators from E into T{K, R) . 
Let 

A:C 2 (K,R)^F(K,R) 

be the operator defined by setting for any f G C 2 (K, R) and x G K 

1 p d 2 f p df 

A(f)(x) := - £ a ij (x)^-(x) + j:f3 i (x)Mx)+ 1 (x)f(x). (3.16) 

Assume that 

(a) for k = 0,2 

\im\\x\\ k w(x)[(p(n)(L n (l)(x) — 1) — ~y(x)] = uniformly on K 

(b) for k — 0, 1 and i = 1, . . . , p 

liiii\\x\\ k w(x)[(p(n)(L n (pr i o ip x )( x ) ~ Pi( x )] — uniformly on K; 

(c) for every i,j = 1, . . . , p 

\im.w(x)[ip(n)L n ((pri o ip x ){prj oijj x )){x) — aiij(x)] = uniformly on K; 

(d) 

sup w(x)(p(n)L n (d 2 )(x) < +oo, 

n>l,xeK 

(e) there exists some q G R ; q > 2, such that d x £ E for all x G K and 

\im w(x)ip(n)L n (d x )(x) = 
uniformly with respect to x G K . 



d 2 f 



:i< 



Then for any f G C 2 (K, R) such that all the partial derivatives 
i < P, 1 < J < p) ire uniformly continuous and bounded, 

limw(x)[(p(n)(L n (f) — /) — A(/)](x) = uniformly on K. 

Furthermore, the same result holds true provided that the operators L n are 
of the form (3. 8) and condition (e) is replaced by the following one 

(f) f or every 5 > 



\imtp(n)w(x) / d, r da nx = 

« JK\B(x,S) ' 



uniformly with respect to x G K . 
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Proof. We shall apply Theorem 3.1, part (2) and Corollary 3.2 to the present 
case. To this aim, consider / G C 2 (K,R) such that 9 1 is uniformly contin- 
uous and bounded for every i,j : 1, . . . ,p. Define for any n G N the operator 

L n : S(K, R p ) h+ T{K, W) 

by setting for every ip G S(K, W) 

L n ((p) : = (L n (pri o tp), L n (pr 2 o </?),... , L„(pr p o </?)). (3.17) 

Now observe that for any linear form u on R p , denoted by (ui, . . . ,u p ) the 
vector in R p representing u, for every x, y G K we have 

(uoip x )(y) = J2 u iP r i(v - x ) = ^Z u ii.P r i i , x){yY, 

1=1 1=1 

therefore for any x E K 

L n (uotp x ) = L n (^2ui(priOil> x )) = ^UiL n (pri oip x ) = uo L n (ip x ). (3.18) 
i=i i=i 

If we set /9(x) := (/9i(x), . . . ,{3 p (x)) for any x G if, we have for k=0,l: 
\imw{x)\\x\\ k { V {n)L n {4) x ){x) - (3{x)) = 

uniformly on K because of hypothesis (b) and of equality (3.18), and therefore 
condition (ii) of Theorem 3.1 is fulfilled. 

Furthermore, in order to verify that (iii) of the statement of Theorem 3.1 is 
satisfied by a suitable operator T, set for any v G C^ 2 '(W, R) and x G K 

v 
T(v)(x) : = Yl v ij a ij( x )i ( 3 - 19 ) 

where (fy)ij=i,..., p is the p x p matrix canonically associated to v. 
In particular from (3.19) we deduce 

nr (*)) = f aSL:(*K- (* e *) 

i,j=l ^ J 

We also point out that, if || • H2 denotes the Euclidean norm on R p , there exists 
h > such that || • || < h\\ ■ ||2- Hence, for every v G £^ 2 ^(R P ,R), denoted by 
(fy)j,j=i,...,p the relevant representing matrix, we have that 

I Vij \< h 2 || v || for every i,j = 1, . . . ,p. 
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Consider a bounded subset B of C^ 2 \W, R) and set sb '■= sup„ eB ||i>||; there- 
fore for every x G K 

\w(x) (v(n)L n (v o ^)( x ) - T(v)(x))\ = 

(p \ p 

J2 Vij(pri o tl) x ){prj O ij} x ) J (x) - J2 V iJ a ij( X ))\ = 

p 
= |w(x)( J2 Vij{ip{n)L n ({pri o ^)(pr j o ^ x ))(x) - «ij(x))| < 

p 
< \w{x)\h 2 s B J2 \(<p(n)L n ((pri°ipx)(prj o tp x ))(x) - Oij(x))\ 

and so, by hypothesis (c), 

\imw(x)[(p(n)L n (v o ip^){x) — T(y)(x)] = 

uniformly on K and with respect to v e B. 
Therefore, Theorem 3.1, part (2) applies, and thus 

\imw(x)[ V (n)(L n (f) - f) - A(f)](x) = 

uniformly on K. 

Finally, if each operator L n is of the form prescribed in (3.8) and condition 
(f) replaces (e), by Corollary 3.2 the assertion readily follows. 

□ 

Remarks 3.6 1). In the particular case in which K is a (possibly unbounded) 
real interval, the same result described above was already obtained in [2], 
Theorem 1. 

2). Theorem 3.5 includes, as a particular case, Theorem 4.1 of [9]. 

4 Applications 

In this Section we make use of the results of Section 3 to obtain an asymptotic 
formula for Bernstein-Schnabl operators defined on a bounded convex subset 
of a separable Banach space as well as for the multidimensional extension of 
the so-called Szasz-Mirakjan operators. 
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Let X be a separable Banach space, and consider a convex and bounded 

subset K of X. Moreover, let (fi x )xex be a family of probability Borel measures 

on K such that 

/ u\K<ifi x = u(x) for any u G X' . (4.1) 

Jk 

Set E := Cb(K, R) the space of all bounded continuous functions on K. The 
space E verifies (2.9) because K is bounded. 

For every n > 1 consider the n-th Bernstein-Schnabl operator defined by 

B n (f)(x):= ( ■■■ ( f( Xl + '" + Xn )dfi x (x 1 )---dfi x (x n ) (4.2) 

Jk Jk V n J 

(/ G E, x G If) (see [3], Chapter 6, for several additional properties of these 
operators) . 

Notice that if we consider the mappings ir n : X n — > X defined by 

7r„(:r !,•••,£„) := — (x ± , ■ ■ ■ , x n G X) (4.3) 

n 

and if we denote by 

Hx,n ■= Tf n (fi x <S> ■■■ <S> (J>x) (x e X,n>l) (4.4) 

the image measure of the n-times tensorial product /j, x <g> • • • <S> H> x under 7r n , 
the operators B n may be put in the form (3.8) by writing 

B n (f)(x) = f fdfji x , n (feE,n>l,xe K). (4.5) 

J K 

Moreover, by virtue of a theorem of Kolmogorov's (see [7], Corollary 9.5), 
for any x G K there exists a sequence (X njX ) n >i of independent random vari- 
ables defined on a suitable probability space (Q, J-, P) and with values in K 
such that fi x is the distribution of X HyX , for all n > 1. 

This argument allows us to write the operators B n also as 

B n (f)(x) = J n f(^f2X iiX y \dP. (fEE,n>l,xeK) (4.6) 

Finally we remark that for every v G C^ (X, R) and x G K 

\v(y,y)\d/j, x (y) < +oo, 

K 

because of the boundedness of K. 

We are now ready to state the following result 
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Theorem 4.1 Let X be a separable Banach space, K a bounded and convex 
subset of X and (fi x )xeK a family of probability Borel measures on K satisfy- 
ing (4-1)- Consider the sequence (B n ) n >i of the Bernstein- Schnabl operators 
defined by (4-2), and a bounded function w G J-"(K, R). If 

sup | w(x) | nB n (d x )(x) < +00, (4.7) 

n>l,xeK 

then for every f G UC%(K, R) 



lim w(x) 

n— >oo 



n(B n (f)(x) - f(x)) - - (j f"(x)(y,y)dfi x (y) - f"(x)(x,x] 



uniformly with respect to x G K . 

Proof. We shall apply Theorem 3.1 and Corollary 3.2 to the present case, 
where E := Ct,(K, R) and <p(n) = n, (n > 1). Preliminarily observe that 

B n (l) = 1, (n > 1) 

and, for any u £ X' and x G K, 

B n (u o ip x ) = u o ip x . (n > 1) 

Moreover, for every v G C^ (X, R) and x G K 

B n (vo^)( x ) = 

/ / X\ T" ' ' ' T" X n X\ "T ' ' ' T" X n \ , . , . 

/ v x, x d^ x [x x ) ■ ■ ■ dfj, x {x n ) = 

k Jk \ n n / 

/ X\ t" • • • "T 3; n 3/1 t ' ' ' t j;^ \ . . 

w , d\x x {x\) ■ ■ ■ d\x x [x n ) + 

k Jk V n n / 

2/ ••• / t>(— -,x) d/ji x (xi) ■ ■ ■ dfj, x (x n ) +v(x,x) = 

Jk Jk V n / 

1 n r r 

X) / / v{xi,Xj)d^ x {x i )dn x (x j ) -v(x,x) = 



n 2 ij^iJi<-"< 



— in J v{y,y)dfi x (y) + n(n - l)v(x,x)j -v(x,x) 



rv 

- (j K v(y,y)dfi x (y) -v(x,x] 



Therefore, if we set 7 = 0, j3 = 0, -B„ = I for every n G N (where 
I is the identity operator on S(K, X)) and if we consider the operator T : 
£^(X, R) -> ^(X, R) defined as 

T(v)(x): = / v(y,y)dfx x (y)-v(x,x), (v E C {2) (X,R),x e K) 

Jk 
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conditions (i), (ii) and (iii) of Theorem 3.1 are all satisfied. 

In addition, set D := sup^g^ ||x||; we shall prove that formula (3.9) is 
verified. Consider 5 > 0, x G K and n G N, and take (4.5) and (4.6) into 
account; then 



/ d 2 x dll x , n < AD 2 lK\B(x,8)dHx, n = (1) 

JK\B(x,S) Jk x ( ' 

= 4L> 2 J a lK\BM o (^X^dP = 

1 n 
= AD 2 P{\\-J2X x ,-x\\>5}. 

By the strong law of large numbers there is some v G N such that for any 
n G N, n > v 

1 n 8 

E{\\-Y.X x , t -x\\)<-. 
n i=1 z 

therefore, following the proof of a Lemma by Beska and Dziedziul (in [8], pg. 
715), for any such n we obtain 

1 n 1 n A 1 n 

p{|| -£**,*-* H>*} <p{\\-J2x x ,-x\\>°- + e\\-Y^x x ,-x\\} 

™ n n5 

< P{\\ J2X x ,-nx || -E(\\ J2X x , t -nx ||) > — }. 

8=1 1 = 1 

By applying an inequality established in ([11], formula (2.19), page 184) to 
the independent sequence of bounded random variables on K 

X n := X x ^ n -x, (ne N) 

we deduce that 

n n r 

P{\\J2X Xii -nx || -£(|| £X X)< -nx ||) > — } < 

«=i i=i ^ 

n 8 n 8 n5 

~ 2e ^(- 32E r=i l|X n p } - 2eXp{ -^AD^ = 2eXp{ -12SD^ 
combining this last inequality with (1) we obtain that for any 8 > and n > v 

n5 2 s 



/ d 2 du x n < 8D 2 exp(- 

JK\B(x,5) 



Ik\b(x,8) x ^' n - ^ v 128D 2/ 

uniformly with respect to a; 6 if, so that condition (3.9) of Corollary 3.2 is 
also fulfilled. The assertion follows straightforwardly from this last mentioned 
corollary. 
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□ 

Remarks 4.2 1) If (X, <•,•>) is a Hilbert space, condition (4.7) holds true 
with w — 1; indeed, for any n > 1 and x E K 

B n (d 2 x )(x) = 

• • / < x, x > dii x (xi) ■ ■ ■ dn x (x n ) 

k jk n n 



- ( / \\y\\ 2 dvx(y) - IMI 2 ) • 



n 

2) Theorem 4.1 was also obtained by Beska and Dziedziul in [8], Theorem 
4.2, in the special case in which K is a simplex and w — 1. 

If K is a convex and compact subset of some Euclidean space, a similar 
result was previously proved in [1], Proposition 2.1 (see also [3], Theorem 
6.2.5). 

As a second application of the results of the previous section we examine 
an asymptotic formula for the sequence of the p-dimensional Szasz-Mirakjan 
operators, p > 1, which has been introduced by a more general scheme by 
Nishishirao in [10], Section 4. 

Consider the space R p endowed with an arbitrary norm || • || and set K :— 
{x = (xi)i<i< p G R p | Xi > for every % — 1, . . . ,p} and 

E:={f EC(K,R) : \f(x)\ < M\\x\\ m for any x e K 

and for some M > and m G N}. 

Next we introduce the n-th p-dimensional Szasz-Mirakjan operator, defined 
for any / G E and (x±, . . . , x p ) G K as 



A.,(/)(xi,...,x 1 ,):= e -5Xi*'5:...i;/ ^,...,? n 



f\/\ rv>ff \ -■ — r \liX^ 



fci=0 k„=0 \ n n J i=l hi- 



(4.3) 



The positive operators A UtP are well defined on E. Assume, indeed, that a 
function / G C(K, R) satisfies a growth condition of the form |/(x)| < M||a;|| m 
(x G K) for some m G N and M > 0. If m = 0, / is bounded and clearly the 
series in (4.3) are convergent; if m > 1, since the norm || • || is equivalent to 
the Z m -norm 

/ p \ V m 
||(xi,...,x p )|| m := (S^r) ((x 1 ,...,x p ) G X), 
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there exists a further constant R > such that 

\f(x)\<R\\x\\™ (xeK). 

Therefore for every x = (xi, . . . ,x p ) G K the series in (4.3) is convergent 
as a linear combination of the sum of the series 

SO" 6 ? "- 1 rt 

which are convergent by the ratio test. 

Among other things, the operator A n ^ p may itself be put in the form (3.8); 
indeed, consider the following probability measure on R: 

^:=£e-^4^£, (y>0,n>l), 

I n III " 

where €h denotes the unit mass at - and set, for any x := (x±, . . . , x p ) G if 

n 

v 

T n ,x ■= ®7T n , Xi ; (n > 1) 

i=l 

then we may write 

^n, P (/)(x) = / fdr n , x . (feE,xeK,n>l) 

Jk 

Let us now state the pre-announced asymptotic formula regarding the se- 
quence (Ai, p )„>i. 



Theorem 4.3 For any f G C 2 (K,R) such that every partial derivative 9x .g x . 
(1 < i, j < p) is uniformly continuous and bounded, 



lim 



n 1 + \\x\ 



1 p d 2 f 
n(A ntP (f)-f)(x)- ^Y, x i-^jl( x ) 

z i=\ ox i 



(4.4) 



uniformly with respect to x = (x±, . . . , x p ) G K . 
In particular, for every compact subset Q of K 



limn(A n , p (/) -/)(*) = -£ 



1 * d 2 f 



&i ^ rt [X j 



uniformly with respect to x = (xj)i<j< p G Q. 
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Before we prove (4.4), we remark that it has already been established in the 
one-dimensional case for two different classes of operators which both include 
the Szasz-Mirakjan ones (see Theorem 2.3 in [4] and Proposition 3.1 in [5]). 

Let us now proceed to verify that conditions (a)-(e) of Theorem 3.5 hold 
true. 

To this purpose set 7 = 0, /% = 0, a^ = pr\, and a^j — if i ^ j, for 
every i, j = 1, . . . ,p, and moreover set w(x) := 1+ i| a .|i2 ( x *= K) an d V 9 ( n ) := n 
for any n G N. 

Notice that, with this coefficients, the differential operator (3.16) prescribed 
in the statement of Theorem 3.5 has the form 

1 p d 2 f 
A(j)(xi, . . . ,x p ) = - 2 ,Xj—^{xi, . . . ,Xp) 

for all / G C 2 (K, R) and (xi, ...,x p )eK. 

For any n > 1, i, j — 1, . . . ,p and x — (x±, . . . , x p ) G K, we get 

^■n,p(l) = 1) 
Ai,p(P r i) = VTi-, 

A n ,p(P r i) = P r i + -pn, 
and hence 

A n , p (priOip x )(x) = 0, 
Ai, P ((P r i ° ^x)(prj o ip x )){x) = if i^j, 

' n 

so that (a), (b) and (c) are satisfied; moreover, as 



1 p 
An, P {d x ){x) = —y^Xj, 



n i=i 



we readily deduce that 



nA nyP (d 2 x )(x) 



sup - < +00, 

x&K,n>l 1 + \\X\\ Z 



and thus also formula (d) holds true. 

Next fix x = (xi, . . . ,x p ) G K and observe that for any y G K, y 

/ p \ 2 v 

d t(y) = J2( x i - Vif ^ p 2 J2( x i - Vi) 4 i 



\i=l / i=l 
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that is 



p 

J X ) 1 

1=1 



and thus for any n > 1 

v 

A n:P (dl)(x) <p 2 J2 A n . p ((pri o ip x ) 4 )(x). 

i=\ 

On the other hand, for every % — 1, . . . ,p it can be proved that 
A n , p ((pr, o ^ x )%x) = £ e—(- - xA A -^f = \x\ + % 

fc=0 

(this equality is fully detailed in [12], Proposition 4; otherwise one may see it, 
in a more general case, in the proof of Proposition 3.1 of [5]). 

Summing up we obtain for every x — (xi, . . . , x p ) € K and n > 1 

i=l U '«' 



1+lbP 



which implies that 

lim — — "'''"1!, y = uniformly on K 

" 1 + lkll 2 

and thus, finally, also condition (e) of Theorem 3.5 is satisfied with q=4. 
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Abstract 

The generalised weighted trapezoidal rule is investigated which in- 
volves /'"' (t) of bounded variation. If the weight is taken to be identically 
unity, then previous results are recaptured. A comparison with weighted 
Ostrowski results is made and it is demonstrated that if the weight is 
symmetric about the midpoint over the interval [a, x) and (a;, b] then the 
bounds are the same. In particular, if the weight is unity, then the gener- 
alised trapezoidal and Ostrowski results produce the same bounds. 

Keywords: Weighted Rules, Generalised Trapezoidal Bounds, Ostrowski. 
AMS Subject Classification Codes: Primary 26D15; Secondary 65D30. 

1 Introduction 

K.S.K. Iyengar [11], by means of geometrical consideration, has proved the 
following theorem. 

Theorem 1 Let f be a differentiable function on [a, b] and \f'(x)\ < M. Then 

(1) 



f(x)dx- l -(b-a)(f(a) + f(b)) 



(See also [18, p. 471 - 474] for related results). Further generalisations were 
also given by Agarwal and Dragomir [13], and Cerone and Dragomir [15]. 
In [19], the following generalisation of Theorem 1 is proved analytically. 
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Theorem 2 Let f (x) be a differentiable function defined on [a, b] and \f (x)\ < 
M for every x G (a, b) . Ifp(x) is an integrable function on (a, b) such that 

0<c<p(x)<Xc (A> 1, x€ [a, 6]), 

then the following inequality holds 



A(f;p)- 1 -(f(a) + f(b)) 



< 



M (b - a) (X + q)(l-q 2 )+2(\~l)q 
2 ' 2A(l + a)-(A-l)(l + o 2 )' 



(2) 



where A and q are defined by 



J aP (x)f(x)dx |/(6) -/(o)| 

A(f;p)= ^-— ~ and q~ 



J a p(x)dx 



M{b-a) 



Cerone and Dragomir [6] also proved the following weighted trapezoidal re- 
sult. 

Theorem 3 Let f : I C R —>■ M be a differentiable mapping on 1 (the interior 
of I) and [a, b] <zl with M = sup /' (x) < oo, m = inf /' (x) > — oo be the 

x£[a,b] xe[a.b] 

first moment of w (•) on [a, b] . Lf f is integrable on [a, b] , then the following 
inequality 



w(x)f(x)dx--[f(a) + f(b)]-m 



[b-a 



v /, n , „ n M — m ,, . 
< - (b - a) (S - m) < ~^—v(b - a) , 



(3) 



where S is the slope of the secant on [a, b] . 

The well-known Ostrowski inequality is given by the following theorem [19], 

Theorem 4 Let f be a differentiable function on [a,b] and let \f (x)\ < M on 
[a, b] . Then, for every x € [a, b] , 



fix) 



f(t)dt 



< 



(b-af 



(b - a) M. 



(4) 



A weighted version of the Ostrowski inequality (4) for Holder mappings has 
been given in [8] by Dragomir et al. Other results related to the Ostrowski 
inequality may be viewed in [2] , [3] , [20] , [23] and a book devoted to Ostrowski 
type results edited by Dragomir and Rassias [9]. A weighted multidimensional 
generalisation of Ostrowski 's inequality was treated by Milovanovic [14]. 

In a recent thorough article Matic, Pecaric and Ujevic [13] obtained weighted 
n— time differentiable Ostrowski type results in terms of a variety of norms. 
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The norms, including when /(") is of bounded variation, Holder continuous and 
diffcrcntiablc allowing for /(™ +1 ) g L p [a, b] in terms of the Lebesgue norms ||-|| , 

P> 1. 

It is the intention of the current paper to examine bounds for the generalised 
weighted trapezoidal functional. Bounds will be provided assuming f^ n > (•) to 
be of bounded variation, absolutely continuous, Lipschitzian and monotonic. 
Placing restriction on the weight function provides more explicit coarser bounds. 
This is accomplished through the use of a result due to Karamata [12]. Qi [21] 
and [22] examines weighted trapezoidal bounds using a Taylor series argument. 

Following the presentation of some notation, identities are obtained for our 
functional of interest in Section 2. Various bounds arc developed in Section 3 
while in Section 4, the relationship between the trapezoidal and corresponding 
Ostrowski functionals is investigated. It is demonstrated that the bounds are the 
same if the weight function is symmetric over the mid-points of the respective 
intervals [a,x) and (x, b]. 

2 Some Notation and an Identity 

Before proceeding to develop an identity, it is worthwhile to introduce some 
notation. 

Let w (•) be a weight function and suppose that w : [a, b] — ► (0, oo) is inte- 
grablc on the interval [a, b] and such that 

< / w (t) dt < oo. 



Also, let 

mfe (c, d;w) — u w (u) du (5) 

represent the k moment about the origin of the weight function w (•) over the 
interval [c,d] C [a, b]. Further, let 



< M n (a, x; w) = — t I (u — a)' 1 w (u) du 



1^2 [ fc ) ( _a ) n k m k (a,x;w) 



fc=0 



(6) 



1 ' 6 



and 

< M n (x, b; us) = — I (b — u)' 1 w (u) du 

n - J X 

= ^E(^)&"- fe (-l) fe m fc (a ; ,6; W ) 
' fe=0 ^ ' 
It may be observed that for x € [a, b] 



(7) 



rb 
Mq (a, b; w) — Mq (a, x; w) + Mq (x, b;w) — I w (t) dt — toq (a, b; w) 
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and 



(x-a) n+1 (b-x) n+1 

Mn(a,x;l)= [ {n+ [ )[ , M n { X Ml)= \ n+ \ )[ ■ (8) 



We introduce the kernel 



1 / /. in-i 



(t — u) w (u) du, n £ N, 



Q n (x,t;«;):=| (n 1)! ^ - ' i,tg[o,i] (9) 

[ w (£) , n = 0, 

which satisfies 

9 ^ = Q n _ 1 , neN. (10) 

The kernel may further be written, using (6) and (7), as 

{(-l) n M n _i(t,i;w), a<t<x, 
neN (11) 

M„_i (x, i; w) , x < t < b, 

and Qo (x, t;w) — w (t) . 

Further, define the functional 



r b 
T n (a,x,b;f;w)= I w(t)f{t)dt 



J2 [M k (a, x; w) /<*> (a) + (-l) fc M k (x, b; w) /« (b) 



(12) 



fe=0 



for / : [a,b] — ► R, x G [a, 6] and w(-) is a weight function with Mk (-,-',w) as 
defined by (6) and (7). The following theorem holds. 

Theorem 5 Let f : [a, b] — ► R wzi/i a < b. For n = 0, 1, 2, . . . Zet Q n +i (x, i; w) 

6e as gwen &y fP). Further, suppose that for some neNU {0}, f( n > (t) exists 

for t e [a,b], where f(°> (t) = f (t) then for f( n > (•) of bounded variation the 

identity 

r b 

v i+1 / n ... , t-r f M uf(») 



T„ (a, x, 6; /; u;) = (-1)" +1 / Q n+1 (x, t; w) df^ (*) (13) 

/io/ds where T and Q n +i are as defined by (10) and (9) respectively. 

Proof. Before proceeding with the proof it is worthwhile to firstly note that 
the solution to the recurrence relation 

u n = a n -u n -i for n=l,2, ... (14) 

is given explicitly as 



= J2(-lY l ~ k *k + (-l) n u a . (15) 



i—k 

fe=i 
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Now, let 

i-b 

In) 



In= / Qn+l(x,t;w)df n >(t) (16) 

J a 

then integration by parts of the Ricmann-Sticltjcs integral gives 

In = Qn + 1 (X, t- W) /W (t)]" a + J" ^i/ (n) (t) dt (17) 

and so 

I„ = A n (a,x,b;w;f) - I„-i, n=l,2, ..., (18) 

where, to obtain (18) from (17), we have used 

A n (a, x, b; w; /) = Q n+1 (x, 6; u,) /<»> (6) - Q n+1 (x, a; w) /W (a) (19) 

and (10) and the fact that for /("- 1 ) (£) diffcrcntiablc, df (n - 1 '> (t) = /(") (i) dt. 
To obtain Jo we may either use integration by parts from (16) or equivalcntly 
extend the validity of (18) to n = 0, producing 



where from (16) 
and so from (9) 
That is, 



I = A (a,x,b;w;f) - J_] 



,6 

/-i= / Qo{x,t;w)f^{t)dt 



/-i = / w (t) I (t) dt. 



I = A (a,x,b;wJ)- I w(t)f(t)dt. (20) 



The solution of (18) on comparison with (14) and (15) upon using (20) is given 

by 

In = (-1)" +1 / w (t) f (t) dt + JT {-l) n - k A k {a, x, b; w; f) . 
Ja fe=o 

Thus, 

/b n 

w (t) f (t) dt-Y, (-1)" M (a, x, b; w; f) (21) 

fe=0 

where from (19) and (11) 

A k (a, x, b; w; /) = M k (x, b; w) /« (b) - (-l) k+1 M k (a, x; w) /W (a) 
and so from (21) the identity (13) is procured. I 
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Remark 1 If f( n ' (i) is absolutely continuous on [a,b], then it is differentiable 
and df( n 1 (t) = /( n+1 ) (t) dt giving from (13) the identity 



T n (a,x,b;f;w) = (-l) n+1 / Q n+1 (x, t; w) / ( " +1) (t) dt 



where T n and Q n are given by (12) and (9) or (11) respectively. 
Remark 2 It w(i) = 1 then from (9) 



(x,t;l) 



{t-xf 



nl 



and from (6) and (7) 



i \k+i /, \fc+i 

M k (a,x;l) = ,, and M k (x,b;l) = ^—— \, . 

ky ' ' ; (fc+1)! kK ' ' ' (£;+l)! 



(22) 



r-b n 

T n (a,x,b;f;l) = / / (t) dt - J^ 



Further, from (12) and (22) 

n (x - a) k+l /W (a) + (-l) fe (b - x) k+1 /« (b) 

fe=0 

is the identity obtained in Cerone et al. [7], giving the non-weighted n—time 
differentiable generalised trapezoidal identity. 



3 Inequalities for the Generalised Weighted Trape- 
zoidal Rule 

The following well known lemmas will prove useful for procuring bounds for a 
Ricmann-Sticltjcs integral. They will be stated here for lucidity. 

Lemma 1 Let g, v : [a, b] — ► M be such that g is continuous and v is of bounded 
variation on [a,b\. Then the Riemann-Stieltjes integral J g (t) dv (t) exists and 
is such that 



9 (t) dv (t) 



< sup \g(t)\\/(v), 
te[o,6] 



(23) 



where \/ (v) is the total variation of v on [a, b]. 



Lemma 2 Let g, v : [a, b] —>■ R be such that g is Riemann integrable on [a, b] 
and v is L—Lipschitzian on [a,b]. Then 



g (t) dv (t) 



<L / \g(t)\dt 



(24) 
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with v is L— Lipschitzian if it satisfies 

\v(x) -v(y)\< L\x-y\ 

for all i,y£ [a, b] . 

Lemma 3 Let g, v : [a, b] — > R be such that g is Riemann integrable on [a, b] 
and v is monotonic nondecreasing on [a, b] . Then 



9 (t) dv (t) 



< 



\g(t)\dv(t). 



(25) 



It should be noted that if v is nonincrcasing, then — v is nondecreasing. 

Theorem 6 Let the conditions of Theorem 5 continue to hold so that f( n ' (t) is 
of bounded variation for t e [a, b] . Then we have for all x € [a, b] and n G NU{0} 

\T n (a,x,b;f;w)\ 

( [M n (a,x;w) + M n (x,b;w) 

+ \M n (a, x; w) - M n (x, b; w)\] X \ \j" a (/W) , 

[M n+ i (a, x; w) + Af„ +1 (x, b; w)] L, f( n ' L — Lipschitzian 

M n (a,x;w)[fW(x)-fW(a)] 

+M n (x, b; w) [/(") (b) - /(") (x)] , /(") monoton. nondecr. 

(26) 
where T n (a, x, b; /; w) is given by (12) and M n (a, x; w), M n (x, b; w) by (6) and 

(V- 

Here, by \J a (h) we signify the total variation of h (t) for t G [a, b] . That is 

\/ b aW = f b a \dh(t)\. 

Proof. Taking the modulus of identity (13) and using Lemma 1, we have 



\T n (a,x,b;f;w)\ 



Q n+1 (x,t;w)df^(t) 



< sup \Q n+1 (x,t;w)\\J (fW 

te[a,b] I v 



(27) 



Now, from (9) 



sup \Q n+ i (x,t;w 

te[a,b] 

1 



max ■ 



(u — a) w (u) du, / (b — u) w (u) du 



(28) 



max{M„ (a,x; w) ,M n (x, b; w)} 
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and so using the fact that max{X,Y} = \ [X + Y + \X — Y\] gives the first 
inequality in (26) upon utilising (27). 

For /(") (•) L— Lipschitzian on [a, b], then from Lemma 2 and (27) 



\T n (a,x,b;f;w)\ = 



%+1 (x,t;w)df^(t) 



(29) 



< L / \Q n+ i (x,t;w)\dt. 

J a 

Using the definition (9), we may notice that 

' (-l) Il+1 
Qn+i(x,t;w) = < 



1 

I n 



n\ j t 

i 



(u — t) w (u) du, t G [a, x] 



(30) 



! (t — u) w (u) du, 



t e (x, b] 



and so 



n\ / \Q n +i (x,t;w)\dt 



b t 



(u — t) n w (u) dudt + I / (t — u) n w (u) dudt. 



(31) 



We may simplify the expression on the right by an interchange of the order of 
integration to give 



(u — t) n w (u) dudt = w (u) / (u — t) n dtdu 



(32) 



q-j- / (u — a) n w (u) du 



and in a similar fashion 

6 t 

(t — u) n w (u) dudt 
Hence, from (31), 

rb 



n+ 1 



{b — u) w (u) du 



\Q„+i (x,t;w)\dt = M n+ i (a,x;w) + M n+X (x,b;w) 
giving the second inequality in (26) upon utilising (29). 



(33) 
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For the final inequality in (26), when /(") (t) is monotonic nondecreasing on 
[a, b] , we use Lemma 3 and thus, from identity (13) 



\T n (a,x,b;f;w)\ = 



) n+1 (x,t;w)(tfM(t) 



< / |Qn+i(*,t;«;)|# (n - 1) (t). 



Two cases need to be treated. For n = then 



< 



\T Q (a,x,b;f;w)\< / \Q 1 (x,t;w)\df (t) , 

a 

w (u) du, t <= [a, x] , 
w (u) du, t € (x, b\. 



b J t 
w (u) du\ df(t)+ / [ [w (u) du I df (t) 



where, from (9), 


f r% 


\Qi (x,t;w)\ = < 


I, 
f 




< J X 


Thus, 




\T (a,x,b;f;w)\ 





X \ X 



w (u) du f (t) 



+ I / w(u)du\ f(t) 

-M (a,x;w)f(a) 



w (t) f (t) dt 



- w(t) f (t) dt 

t=x x 



(34) 



(35) 



+ / w(t) f (t) dt + M (x, b; w) f (b) - w (t) f (t) dt 
< M (a, x; w)[f (x) - f (a)} + M (x, b; w) [f (b) - f (x)} . 

Here we have used the fact that if g (t) > and / (t) monotonic nondecreasing 
for t€ [a, b], then 

b b 

(t) f (t) dt < f(b) g (t) dt and 

a b (36) 



g(t)f(t)dt <-/(o) g(t)dt. 
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For n GN then 



n! / \Q n+1 (x,t;w)\df^(t) 

(u - t) n w («) du I df {n) (t) 



(37) 



X W X 



(t - u) n w (u) du\ d/(") (t) 



:— A n + £>„ 
Integration by parts produces on using the Leibnitz rule, 

.1,, - i / (u-t) n w(u)du\ /(")(«) 

( f X (u - tf- 1 w («) du") /<") (t) dt 

cc /a; 

= -n\M n (a, x; w) / (n) (a)+n f f (u - t)^ 1 w («) du ) /W (t) d£ 

a \ t 

< n\M n (a, x; to) \f {n) (x) - /W (a) 
and 

B„ = | / (/ -»)"<r(«)du J /<">(*) 

(t - w)" _1 w («) du] /(") (t) dt 

6 / t 

= n\M n (x, b; w) / (n) (b) - n f I f (t - u) n ~ l w (u) du | /< n > (i) di 

<n!M n (a:,6;«»)f/W(6)-/( n )(x)" 



X \J X 



where we have used the monotonicity of /(") (•) via (36) together with (31) and 
(33) to obtain the upper bounds. 

Substituting A n and B n into (37) and (34) and further recognising that it 
subsumes the result for n — as given by (35), then the last inequality in (26) 
results. I 

Remark 3 For the monotonic nondecreasing result in (26) a tighter bound 
could have been obtained if the result (36) were not used. This, however, would 
have produced a more cumbersome bound. The n = case may have been ac- 
commodated in the general n case since, as given in (9), Qo (x, t; w) = w (t) and 
since w (t) > then \Qq (x, t;w)\ — w (t). 
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The following theorem gives bounds on \T n (a, x,b; /; w)\ in terms of the 
Lebesgue norms of /(™ +1 ) (£). 

Theorem 7 Let the general conditions of Theorem 5 hold and further, let 
f( n > (t) be absolutely continuous for t <G [a, b], then 

\T n (a,x,b;f;w)\ 

( [M n+1 (a,x;w) + M n+1 (x,b;w)}\\f( n + 1 1tt, /<" +1 ) e L x [a, b] ; 



< < 



IIQn(av;«OIIJ/ (n+1) || p , 

[M n (a, x; w) + M n (x, b; w) 



/ ( " +1) eipML 



||f(™+ 1 )|| 

+ \M n (a,x;w)-M n (x,b;w)\] nJ g Nl , /<" +1 ) e Li [a, b] , 

(38) 
where M n (a, x; tu) onrf M„ (x, 6; w) are as given by (6) and (7), Q n (a, x, b; w) 
is defined in (9) and T n (a, x, b; w) by (12). 

Further, ||-|| signify the usual Lebesgue norms where 

Halloo := ess SU P IM*)I f or he L oo[a,b], 

te[a,b] 

and 



\\h\\ P -= / \h(t)\ p dt\ for heL p [a,b], 1 < p < oo. 



Proof. From the identity (22) we have on using properties of the modulus 
and integral, 



\T n (a,x,b;w)\< / Q n+1 (x, t; w) /(" +1 > (i) 
Now, for /0+ 1 ) e Loo [a, &] 



dt. 



(39) 



Q„ +1 (x,t;u;)/(™ +1 )(t) 



dt < 



(n+l) 



\Q n+ i (x,t;w)\dt, 



which upon using (33) produces the first inequality in (38). 

For the second bound we use Holder's integral inequality in (39) to give 



Q n+ i(x,t;w)f( n+ V(t) 

b 



(II 



< 



\Qn+i (x,t;w)\ q dt 



f (n+l> (t) 



(11 



\\Qn(x,-;w)\\ 



(n+l) 
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p 1 
The final inequality in (38) is obtained from (39) to give 



where p > 1, - + - = 1 

P Q 



Q n+1 (x,t;w)f( n+ V(t) dt< sup \Q n+1 (x,t;w)\ / f {n+1) {t) 

te[a,b] 



(It. 



where we may use (28) and a property of the max {X, Y} to obtain the stated 
result. ■ 

Remark 4 // we take the weight function w (t) = 1 then the results of Cerone 
et al. [7] involving the generalised trapezoidal rule and n—time differentiable 
functions is recaptured. 

A question that needs to be asked is can we choose the parameter x in such 
a way that the bound is minimized? The following lemma examines such an 
issue. 

Lemma 4 Let 



24> n (a,x,b;w) 

= [M n (a, x; w) + M n (x, b; w) + \M n (a, x; w) — M n (x, b; w) 



(40) 



and 

*n+i (a, X, b; w) = M n+1 (a, x; w) + M n+1 (x, b; w) . (41) 

Then 

<P n {a, x, b; w) = mm 6 n (a, x, b; w) = , (42) 

x£[a,b] 2 

where x is the solution of M n (a, x; w) — M n (x, b; w). 
Further, 



*n+i ( a '^S~ ' b '> w ) = min *n+i (a, X, b; w) 

2 I xe[a,b] 



a + b \ ( a + b 

M n+ i [ a, — — ; w I + M n+ i I ——,b; w 



, VJJ I , 



(43) 



where w (•) is a positive weight function and x G [a, b] , with M n (a,x;w) and 
M n (x,b;w) are as defined in (6) and (7). 

Proof. The functions M n (a,x;w) and M n (x,b;w) are both positive with 
M n {a 1 x;w) increasing and M n (x,b;w) decreasing in x G [a, b\. Thus, the 
minimum is attained when \M n (a, x; w) — M n (x, b;w)\ = giving the result as 
stated. 

Now, ^ n +i (a, x; w) > and 

*„+! (a, a; w) = * n+ i (6, b; w) = M n (a, b; w) . 
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Also, for w (x) > 



*™+i ( a > x > b ' w ) = i( x - a T - ( b - X Y 



w (x) 



and so ^' n+ i (a, a, b; w) < 0, ^' n+ \ (a, b, b;w) > bringing us to the conclusion 
that ^E' n (a,x, b;w) is convex in x. Since w (x) > 0, the minimum is attained 
when a; = ^, making ^' n+1 (a, ^, 6; w) = 0. ■ 

The following lemma obtains some coarser bounds which may prove to be 
more useful in practice. It involves obtaining bounds on 

*„+! (a,x,b;w) = \\Q n +i (x,-;w)\\i = M n+1 (a,x;w) + M n+1 {x,b;w) . (44) 

Lemma 5 Let w (t) be a weight function defined on [a, b] and x € [a, b], then 

|*„+i (a,x,b;w)\ = \\Q n +i (x,-;w)||i 

( CWIML' we Loo [a, 6]; 



< < 



C 2 (?) IML. weLp[a,&], 



(45) 



,n+l 



p>M + £ = i; 



where 



and 



C(q) = 



{x-a)^ n+1)+1 + {b-xY {n+1)+l 



q(n +1) + 1 



b — a 



a + b 



Proof. From the definitions (6) and (7) it may be noticed that ^ n +i (a, x, b; w) 
from (44) may be expressed as 



where 



Now, 



*„+! (a,x,b;w) = \\Q n +i (a,x, 6;w)lli 

1 f b 

= 7 7^7 / K n+ (a, x, b; u) w (u) du, 

u — a, u € [a, a;] , 



k (a, x, b; u) = 



b — u, u € (x,b]. 



(46) 



(47) 



(n 4- 1)! I^n+i {a>, x , b; w)\ < / \k (a,x,b;u)w(u)\du 

j . 

and so for w € L p [a, b], 1 < p < 00 then 

rb 

K n+ (a,x,b;u) w (u) du 

< [ / n q{n+1) (a, x, b; u) du ) ( / w p (u) du 



(48) 
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Explicitly, 



n q{n+1) (a, x, b; u) du 



{u-a) q{n+1) du+ I (b-u) 



\?("+i) 



dv 



(x-af n+1)+1 + (b-xf n+1)+1 
q{n + 1) + 1 



which together with (48) gives the second inequality (45). 
For w G Lqo [a, b] , then 



n n+l (a, x, b; u) w («) \du < C (1) ||iu|| c 



the first inequality in (45). 

Finally for w G L\ [a, b] , then 



\n n+ (a,x,b;u) w (u)\du < 



sup k (a, x, b; u) 



n+1 



where 



sup k (a, x, 6; u) = max {x — a, b — x} = 

x£[a,b] 



b— a 



a + b 



Karamata [12] proved the following theorem. 

Theorem 8 Let g, w : [a, b] — > R be integrable on [a, b] and suppose m < g (t) < 
A'/ and < c < to (£) < Ac /or £ G [a, 6] and some constants m, M, c and X. If 
G and A (g, w) are defined as 



G:= 



1 



b — a 



h ,, , , \ g(t)w(t)dt 

g (t) dt and A (g, w) :- Ja B 



j° a w{t)dt 



(49) 



then 

Xm(M-G) + M(G-m) A , , m(M - G) + \M (G - m) 

X(M-G) + (G-m) ^ A ^)< {M -G) + X(G-m) ' (50) 

Using the above theorem of Karamata, the third inequality in (45) may be 
improved. 

If we associate k" +1 (a, x, b; w) , as defined by (47), with g (t) above, then 



< k (a, x, b; u) < v — max {x — a, 6 — x} 



b — a 



a + b 
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and 



Hence, from (46) 



G = ( f^- = — *— [ K n+1 (a, x, b; u) du. 
b — a b — a . I„ 



(n+ l)!*„+i (a,x,b;w) = \\Q n +i (^s^li 



< 



<" n+1 |Mli 



The last inequality follows from the fact that 

i-b 

,.71+1 



(6-a)i/«+ 1 -C(l) + AC(l) 



C (1) = / k" +1 (a, x, b; u) du < u n+l (b - a) 



4 Comparison of Ostrowski and Trapezoidal Re- 
sults 

The generalised weighted trapezoid kernel Q n (x,t; w) defined by (9) and (11) 
is a mapping Q n (-,-;w) : [a, b] - 



} n (x,t;w) 



( til 

n — '. 

(-1) 



(n-l)U t 

n r t 



(n-iy.j x 



a, b <G K, a < b where 



(u — t) w (u) du, t G [a,x] , n£N 



(t - w)" _1 w (u) du, te(x,b], neN ^ 5l ' 



n = 0. 



Here w (t) is a weight function with properties as ascribed earlier in the paper. 
An identity relating the generalised weighted trapezoid functional T n (a, x, b; f; w) 
as defined by (12) is given by (13). 

In our notation define the weighted Ostrowski functional O (a, x, b; f; w) by 

rb n 

9„ (a, x, b; /; w) := w (t) f (t) dt-^Ek (a, x, b; w) / (fe) (x) , (52) 

J a , n 



fe=0 



where 



Ej. (a, x,b;w) = — / (u — x) w (u) du. 



(53) 



Let the kernel K n (x,t;w) be such that K n (-,-,w) : [a,b\" 



a,b e 



452 



P.CERONE.J.ROUMELIOTIS 



a < b and w (•) a given weight function, then 



K n (x,t;w) 



1 



(n-l)U a 



(t — u) n w(u)du, t£[a,x), neN 





(-1)" 
(n-l)lj , 



t = x 



(u — t) w (u) du, t € (x, b] , n£N 



x, t e [a, b] . 



(54) 

Matic et al. [13] show that for /(") (•) continuous and of bounded variation on 
[a, b] then for x € [a, b] 



6„ (a, x, 6; /; w) = (-1)" +1 / n n+1 (x, t; w) <tf™ (t) . 



e (n) 



(55) 



Further, for /(") (•) absolutely continuous on [a, b] then df^ (t) = f {n+l) (t) dt, 
giving the identity 



6„ (a, x, b; /; W ) = (-1)" +1 / K n+1 (x, t; w) f (n+ ^ (t) dt 



(56) 

J a 

from (55). 

The bounds for \T(a,x, b; /; w)| and |@ (a, X, 6; f;w)\ depend on the be- 
haviour of |Q n +i (x,t;w)\ and \K n+ \ (x,t;w)\ respectively 

The following lemma gives sufficient conditions for the bounds on \T (a, x, b; f; w) 
and |0 (a, x, b; /; w) | to be equal. 

Lemma 6 For w : [a, b] — > (0, oo), J w (t) dt < oo and w (£) symmetric about 
the respective midpoints for t G [a, a;) and t G (x, 6] then, 

\Q n (x,a + x — t;w)\ , t€[o,i), 

t = x (57) 

|Q n (a;, x + b — t;w)\ , t e (x, 6]. 
An interchange of Q n and K n in (57) is valid under the same conditions. 
Proof. Consider for t € [a, x) 



\K n (x,t;w)\ = < 



(n — 1)! \Q n (x, a + x — t; w) 



ra-l 



a+x — t 



[u — (a + x — t)] w (u) du 

(t — v) n w (a + x — v) dv 
{n- 1)1 \K n (x,t;w)\, 
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provided w (a + x — v) = w (v) , that is, w ( 9 4^ — z) = w ( 9 4r 1 + z) for z e 
[a, x). A similar argument gives the result for t G (x, 6]. We note that if £ varies 
from c to d, then T = c + d — t varies from d to c. Thus the interchange of K n 
and Q„ in (57) is valid and the equivalent expression to (57) holds. ■ 

Remark 5 A consequence of Lemma 6 is that if w (t) = a, a constant, then 
the Lebesgue norms giving the bounds for \T (a, x, b; /; w)\ and |0 (a, x, b; /; w)\ 
are equal. In particular, the unweighted case w (t) = a = 1 produce the same 
bounds as shown in Cerone [4j- Weights such as 

, 2/1 



f (t-^) z ", teM 



iy n (£) 



i = .r 



,2n 



(t-^±b) n , te{x,b], neN 



anrf 



t-^L te[a,x) 



W (t) = { 0, 



t = x 



\t- 



x+b | 



, te(x,6], 



would produce the same bounds for the trapezoidal and Ostrowski functions de- 
fined by (13) and (52). 
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Abstract- Several numerical schemes are developed for obtaining approximate solu- 
tions to an initial boundary-value problem for the second-order parabolic partial differen- 
tial equations (PDEs) with an integral condition replacing one boundary condition. The 
space derivatives in the PDE are approximated by finite difference replacements. The 
solution of the resulting system of first-order ordinary differential equations (ODEs) sat- 
isfies a recurrence relation which involves a matrix exponential function. The accuracy 
in time is controlled by choosing several subdiagonal Pade approximants to replace this 
matrix exponential term. Numerical techniques are developed to compute the required 
solution using a splitting method, leading to algorithms for sequential and parallel im- 
plementation. The algorithms are tested on a model problem from the literature. The 
central processor unit (CPU) times needed are also considered and are compared . 

KEYWORDS: Two-Dimensional Diffusion-Numerical Integration- Sequential and 
Parallel Algorithms- Stability-Pade Approximant-Boundary Integral Condition- Central 
Processor Time- Partial Differential Equations. 

1 Introduction 

Two-dimensional parabolic equation with an integral condition replacing one bound- 
ary condition arises in many important applications in heat transfer [1, 2, 3, 4, 5, 6, 7], 
control theory [8], medical science [9], and thermoelasticity [10, 11, 12]. So, recently much 
attention has been given in the literature to the developement, analysis and implementa- 
tion of accurate methods for the numerical solution of time-dependent partial differential 
equations with a boundary integral condition. 

The purpose of this article is to present very efficient parallel methods for solving the 
following two-dimensional time-dependent diffusion equation 

du d 2 u d 2 u 

dt dx 2 dy 2 ' 
with initial condition 

u(x,y,0) = f(x,y), 0<x,y<l, (2) 
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and boundary conditions 

u(0,y,t) = g (y,t), < t < T, < y < 1, (3) 

u{l,y,t) = gi {y,t), 0<t<T, < y < 1, (4) 

u(x, 0, t) = h {x)fi{t), < t < T, < x < 1, (5) 

«(a;,l,i) = /ii(x,t), 0<t<T, < x < 1, (6) 
and the integral condition 

-1 rs(x) 



u(x,y,t)dxdy = m(t), 0<x,y<l, (7) 

o Jo 

where /, go, g\, ho, hi, s and m are known functions, while the functions u and // are to 
be determined. 

The boundary condition (5) is variable separable, with spatial dependence given by 
h (x) and time dependence given by f/,(t). 

The existence and uniqueness of the solution of this non-classic problem has been 
studied in [3]. 

An overview of this paper is as follows: 

Numerical schemes for the solution of ( 1 )-( 7 ) are described in Section 2. Develope- 
ment of numerical methods which will be based on the Pade approximants is presented 
in Section 3. A discussion on the sequential and parallel algorithms is given in Section 4. 
The method of incorporating ( 7 ) with fi unknown is described in Section 5, and numer- 
ical results for a test problem produced by the method developed, are given in Section 6. 
Section 7 concludes this paper with a brief summary. 

2 The Numerical Solution with Dirichlet Boundary 

Conditions 



We divide the domain [0, l] 2 x [0,T] into an M 2 x A^ mesh with spatial step size 
h = 1/M in both x and y directions and the time step size k = T/N respectively. Grid 
points (xi,yj,t n ) are given by 

Xi = ih, i = 0,1,2,..., M, (8) 

y 3 = jh, j =0,1,2,. ..,M, (9) 

t n = nl, n = 0,l,2,...,JV, (10) 

in which M is an even integer. We use w™ • and fi n to denote the approximations of 
u(ih,jh,nl) and [x{nl), respectively. 

The solution vector U n will be ordered in the following form: 



T Tl~l ( Th ft Tii 71/71 71 71 Tl \ J- ( *| *| \ 

U — V u l,l; u 2,l) • • • -> U M -l,li U l,2i U 2,2i ■ ■ ■ )%-l,2) • • • : U 1,M-1: ■ ■ ■ i U M-l,M-l) ■ l 11 ^ 

The numerical methods suggested here are based on two ideas: Firstly, the method of 
lines semi-discretization approach will be used to transform the model partial differential 
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equation into a system of first-order linear ordinary differential equations, the solution 
of which satisfies a certain recurrence relation involving matrix exponential terms. The 
developement of numerical methods will be based on Pade approximations to such expo- 
nentials. A suitable rational approximant will be used to approximate such exponentials 
leading to an algorithm which may be parallelized through a partial-fraction splitting 
technique. This technique is used to approximate the solution of the two-dimensional 
diffusion equation, at interior grid points. Secondly, a highly accurate numerical integra- 
tion scheme [13] is used to approximate the unknown function /x(t), using the integral 
condition ( 7 ). 

The space derivative in (1) will be replaced by their second-order central difference 
approximation given by 



d 2 u 
dx 2 



h 2 {u(x — h, y, t) — 2u(x, y, t) + u(x + h, y, £)) + 0(h 2 ), 



(12) 



d 2 u 
dy 2 



h 2 (u(x, y — h,t) — 2u(x, y, i) + u(x, y + h, £)) + 0(h 2 ), 



(13) 



as h — » 0. 

Applying (1) to all the (M — 1) interior mesh points of the square at [0, 1] x [0, 1], 
at time level t n = nl, with the space derivatives replaced by ( 12 ) and ( 13 ), leads to a 
system of (M — 1) first-order linear ordinary differential equations of the form [16] 



dU(t) 
dt 



with initial condition 



AU(t)+ip(t), t > 0, 



U(0) = f, 



(14) 



(15) 



in which the matrix A is of order (M — 1) and will be split into the constituent matrices 
B,C which commute such that A = B + C. 

The vector ip(t), of order (M — 1) , arises from the use of the time dependent boundary 
conditions (3)-(6) in (12) and (13). 

The matrix B arises from the use of ( 12 ) in ( 1 ), it is block diagonal with tridiagonal 
blocks and has the form: 



B = h 



-2 



Q 









Q 









': 


Q 





Q 


... 






Q 



(16) 



where Q is the tridiagonal matrix of order M — 1 given by 
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Q 



-2 1 
1-210 

; 1-21 





-2 1 
1 -2 



(17) 



The matrix C arises from the use of ( 13 ) in ( 1 ), it is block tridiagonal with diagonal 
blocks and has the form: 



-21 I 

I -21 I 

; i -2i i 



C = K 







where / is the identity matrix of order M — 1. 



-21 I 
/ -21 



The (M — 1) eigenvalues of the matrix A are real and negative and are given by 



2 F 



Xij = -4:h- 2 \sin 2 —- + sin 2 ^—], i,j = 1, 2, . . . , M - 1. 

1,3 L 2M 2M J 

Solving the system of ODEs ( 14 ) subject to the initial condition ( 15 ) gives 
U(t) = exp(tA)f + f exp[(t - s)A]^(s)ds, t > 0, 



which satisfies the recurrence relation [17] 



t+i 



U(t + l) = exp(lA)U(t) + / exp[(t + / - s)A]ip(s)ds, t = 0,l, 21, . 



(19) 



(20) 



(21) 



in which I is a constant time step in the discretization of the time variable t > 0, at the 

points t n = nl{n = 0, 1, 2, . . .). 



3 Solution at the First Time Step and the Pade 

Approximant 

Following [17] and using the trapezoidal rule for evaluating the quadrature in ( 21 ), 
gives the following second-order formula: 

U(t + l)=exp(lA)U(t) + -[ip(t + l) + exp(lA)il}(t)], t = 0,1,21,31, . . . (22) 

Replacing the matrix A by B + C and using the commutativity of B and C, ( 22 ) 
may be written as 
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I 



U(t + l) = exp(lB)exp(lC)U(t) + -[tf;(t + l)+exp(lB)exp(lC)ip(t)], t 



0,1,21,31,41,.... 

(23) 

The developement of the numerical methods will be based on making appropriate 
approximations in this recurrence relation. Higher-order Pade approximants are popularly 
employed for such exponentials. Methods based on the use of these approximants are of 
high accuracy in time and, in the case of the subdiagonal Pades, have good stability 
properties. 

The pade approximant to the exponential function e , where is a real scalar, has the 
form 



Rb,d{0) 



(24) 



Q b (ey 

where Pd{@) and Qb{0) are polynomials of degrees d and b, respectively with real coeffi- 
cients, in each of which the constant term is unity. The polynomials Pd{0) and Qb(&), for 
the approximants to be used in Section 4 are given in Table 1. 



{b,d)Pade 


Pd(0) 


Q b (9) 


(2,0) 


1 


i-e + °i 


(2,1) 


1 ^ 3 


1 2e , e 2 

3 T 6 , 


(3,0) 


1 


1 fl _i_ e 6 3 


(2,2) 


1 + - + - 
x ~ 2 ~ 12 


1-* + ^ 

2 12 


(2,3) 


1 _L 3e _|_ 3e _|_ ei 
L ~r 5 "T 20 J 60 „ 


1 20 i e 5 

5 "^ 20 


(2,4) 


1 , 26* , e 2 i e 3 , e 4 

t 3 ~ l ~ 5 ~ l ~ 30 ~r 360 


l-i + l 

3 ~ 30 



Table 1: Pade approximant to e e 

Note that the [d,b] denote the Pade approximation to e e . High-order stable subdiag- 
onal (b > d) Pade methods can be generated which are particularly efficient on machines 
utilizing several concurrent processors. 

It will be assumed that Qb{0) has b\ real zero and 26 2 complex zeros occuring in 
complex conjugate pairs so that b = b\ + 2&2- So Rb,d{0) can be expanded in partial 
fraction form, resulting in approximating the matrix exponential function in ( 23 ) by 

exp(A) ~ J£ Wi (A - aiy 1 + 2 J2 Re[ Wl {A - c i /)]" 1 .(25) 

i=l i=6i+l 

The above expansion is particularly useful in parallel computing environments because 
it can be used to apportion the work of solving the corresponding linear algebraic systems 
to processors operating concurrently. Consequently, the solution of ( 14 ) can be imple- 
mented on a machine with bi + b-i processors through parallel implicit Euler-like schemes, 
as described in Section 4. We can also construct parallel algorithms based on diagonal 
[b, b] Pade approximations. 

The numerical method to be developed in this report is based on the use of the 
subdiagonal [1/2] Pade approximant given by 
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exp(lA) « (I - \\A + \l 2 A 2 ) \l + \lA), (26) 

3 6 6 

for the matrix exponentials in the recurrence relation ( 23 ) to give the 0(h 2 + Z 3 ) uncon- 
ditionally stable scheme [18], 

u(t + l) = {I-\lB + \l 2 B 2 ) \l + \lB) 

3 6 6 

(i -\ic + \i 2 c 2 ) \i + hc)u(t) 

6 6 

(/ - \lC + \l 2 C 2 ) \l + \lC)m + Vi(t + /)]. (27) 

6 6 

4 Sequential and Parallel Algorithms 

The implementation of the sequential algorithms resulting from pre- multiplying the matrix 
inverses in ( 27 ) involves using higher powers of the block matrices B and C, thus 
requiring considerable amount of CPU time and computer storage as reported in [14]. 
We can overcome this difficulty using the parallel algorithms. The expansion ( 25 ) has 
great importance in the parallel computing environment in that it can be used to solve 
a corresponding linear algebraic system on processors operating concurrently. Using this 
expansion, parallel algorithms may be implemented on machines with b\ + 2&2 processors 
(only). 

The partial-fraction of (I - \IB + \l 2 B 2 y l (I + \IB) leads to : 

(/ - \lB + \l 2 B 2 ) (I + \lB) ~ [h(I - Zl lBY l + k 2 (I - z 2 W)-\ (28) 

6 6 

So we can express ( 27 ) in its partial-fraction splitting form [18] 

u(t + i) = [h(i - Zl iBy l + k 2 (i - z 2 iBy 1 ][k 1 (i - Zl icy l 
+ k 2 (i-z 2 icy 1 ]u(t) + -ij(t + i) 

+ l -[k 1 (I-z 1 lB)- 1 + k 2 {I-z 2 lB)- 1 ] 

[h(i - z.icy 1 + k 2 (i - zjcy 1 ]^), (29) 

where z\ and z 2 are the poles of the (2, 1) Pade approximant and are given by: 

Zl = z 2 = 0.33333333272613 + 0.23570226861019i, (30) 

and k\ and k 2 are the constants resulting from the partial fraction decomposition and are 
given by: 

fci = k 2 = - - V2t, i = y/=l. (31) 
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Note that the closed form expressions for z\ and z 2 are not available, so we have used 
their approximate values ( 30 ). 

U(t + 1) in ( 29 ), the solution vector at time t = (n + 1)/, may now be obtained via 
the parallel algorithms using two processors [18]: 

Processor 1 : 



(/ - Zl lC) Wl = hU(t) 

Pi = 2Re(wi) 

(I - z 1 W)w 2 = k l p 1 

p 2 = 2Re(w 2 ), 



(32) 



Processor 2 



(/ - Zl C)w 3 = hi>(t) 

p 3 = 2Re(w 3 ) 

(I - z 1 lB)w 4 = k x p 3 

Pi = 2Re(w±), 



(33) 



U{t + l)=p 2 + -[p A + ij{t + l)}. 



(34) 



The intermediate vectors pi,Wi,i = 1,2,3,4, need not to be stored once U(t + /) is 
computed at each time step. The matrices (/ — Z\IB) and (I — zilC) are decomposed 
into lower and upper triangular (LU) forms only once. These LU products are then "fed" 
to the two processors in order to compute the intermediate vectors Zi,i = 1,2,3,4, using 
forward and backward substitutions. 

The principal part of the local truncation error arising from the use of the (6, d) Pade 
approximant is 

,d q+l u, 1 , 0r/ <9 4 w N ,<9 4 -u N 



C q+l l" 



dt 



9+1 



u h % x >- 



dy 



:)], 



(35) 



q = b + d, where the error constants C q+ \ for a selection of second, third and fourth-order 
methods (in time ) are given in Table 2. 



order(q — b + d) 


Pade 


Error Constants (C q+ \) 


2 
2 
3 
3 

4 
4 


(1,1) 
(2,0) 

(2,1) 
(3,0) 

(3,1) 
(4,1) 


-l 

12 

i 

6 

i 

72 

24 
1 

-480 
1 

120 



Table 2: The error constants C„ 
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5 The Computation of //(£) 

In this section the numerical integration procedure to be used to approximate the 
unknown function //(£) using the boundary integral condition (7) will be described. 

The presence of an integral term in a boundary condition can greatly complicate the 
application of standard numerical techniques. The accuracy of the quadrature must be 
compatible with that of the discretization of the differential equation. 

In the following the double integral in (7) is approximated using the fourth-order 
Simpson's composite "one-third "rule. 

Consider the integral 

H(x,t) = / u(x,y,t)dy. (36) 

Jo 

Then application of Simpson's composite "one-third " rule gives 



M/2 (M/2)-l 



f H(x,t)dx^-(H + 4Y,H 2l ^ + 2 J2 H 2t + H M ), (37) 

in which 

r2lih rs(ih) 

H { = u(xi,y,t)dy+ u(x i: y,t)dy, (38) 

JO J2l t h 

where 

U = [s(ih)/2h], (39) 

and [ ■ ] represents the integer part of the argument. 
Substituting in the second integral of (38) 

Zi = y/h - 2k (40) 

yields 

s(ih) rSi 

u(xi,y,t)dy = h u(xi,Zi,t)dzi, (41) 

2Uh JO 

where 

6i = (s(ih)/h)-2li. (42) 

Replacement of u in the integral with a quadratic interpolating polynomial the Newton's 
forward-difference formula) through the grid values concerned, gives 

eh fb 1 

u(xi, Zi, t)dzi = / [ui 2 ii + ZiAuioii + ^Zi(zi - 1) A Ui2ii]dzi + 0(h ), (43) 

o Jo 2 

where 

Am, )2 |, = U it 2li+1 - U it 2li, (44) 

and 

A 2 U it 2h = Ui,2k+2 ~ 2Mj,2/ l+ l + Ui,2h- (45) 

Integrating (43) and collecting the like terms means that 
u h h— 1 

Hi ~ -[u ifi + 4j2u l ,2 ] -i + 2j2u i , 2j + u l ,2 h + 35 t (l-35 i /A (46) 

6 3=1 3=± 

+ S t 2 /6)u h2k + 3^ 2 (1 - Si/3)ui M . +1 + {Si 2 /4)(2Si - 3)^, 2/i+2 ]. 
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Putting 

V(t) = i H(x,t)dx, (47) 

Jo 

and using the approximation 



M/2 (M/2)-l 

(m + 4 

3 



r"^-(m + 4Y,H-_ 1 + 2 £ Hl + H n M ), (48 



8=1 1=1 

then gives 

,2 (M/2)-l M (M/2)-l 

r( 

i=0 i=l i=l 

Note that 



w "-yl E «o,o + 4E<-i,o + 2 E <o + «Sf,o)+« B - (49) 



h r 1 
" 2 



/i" / s(x)dx + R n , (50) 

Jo 



where i? n is the summation in v n excluding the values at the boundary y — 0. 
We then approximate /i™ +1 by means of 



m n+l _ R n fl 

~h~7T 



/i w+1 = ' h \i , Z . / s(x)rfx^0, (51) 

3 JO S (^J«^ ■'O 



from which the boundary values along y = may be computed at time t n+ \ using the 
boundary condition (5). 

6 Numerical Results 



The numerical method described in the previous section was applied to the following 
problem: 

Consider (l)-(7) with 

f(x,y) = exp(x + y), (52) 

9o(y,t) =exp(y + 2t), (53) 

gi (y,t) = exp(l + y + 2t), (54) 

ho(x) = exp(x), (55) 

hi(x) = exp(l + x + 2t), (56) 

/i(t) = exp(2t), (57) 

m(t) = (4exp(exp(l) / '4) - 4exp(l/4) - exp(l) + l)exp(2i), (58) 

s(x) = exp(x)/A, (59) 

for which the exact solution is 

u(x,y,t) — exp(x + y + 2t). (60) 
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New Scheme 


BTCS Scheme 


X 


y 


Exact u 


Error 


Error 


0.1 


0.1 


9.025013 


-5.0 x 10~ 4 


-2.0 x 10~ 3 


0.2 


0.2 


11.023176 


-6.0 x 10~ 4 


-3.0 x 10~ 3 


0.3 


0.3 


13.463738 


-6.0 x 10~ 4 


-2.0 x 10~ 3 


0.4 


0.4 


16.444647 


-5.0 x 10~ 4 


-3.0 x 10~ 3 


0.5 


0.5 


20.905243 


-6.0 x HT 4 


-4.0 x 10" 3 


0.6 


0.6 


24.532530 


-7.0 x 10~ 4 


-3.0 x 10~ 3 


0.7 


0.7 


29.964100 


-8.0 x HT 4 


-3.0 x 10~ 3 


0.8 


0.8 


36.598234 


-6.0 x HT 4 


-2.0 x 10~ 3 


0.9 


0.9 


40.447304 


-6.0 x HT 4 


-4.0 x 10~ 3 



Table 3: Results for u with T = 1.0, h = 0.02, I = 0.0001 



The results for uf, with h 



0.02, I 



i 



10000 



at T = 1.0, using the method discussed in 



Section 4 and the (5,1) fully implicit finite difference BTCS formula of [3] are shown in 
Table 3. 

The CPU time for the new technique was 53.1 sec while the CPU time for the fully 
implicit finite difference method of [3] was 1553 sec. 



The results obtained for \i with h = 0.02, I 



i 



10000 ' 



using the new method developed 



in this paper and the (5,1) fully implicit finite difference BTCS technique to solve the 
example non-classic boundary value problem are shown in Table 4. 



t 


Exact fj, 


0.1 


1.221403 


0.2 


1.491825 


0.3 


1.822119 


0.4 


2.225541 


0.5 


2.718282 


0.6 


3.320117 


0.7 


4.055200 


0.8 


4.953032 


0.9 


6.049647 


1.0 


7.389056 



New Scheme 
Error 
7.0 x 10" 4 
6.0 x 10~ 4 
6.0 x 10~ 4 
6.0 x 10~ 4 
7.0 x 10~ 4 
7.0 x 10~ 4 
8.0 x 10~ 4 
7.0 x 10~ 4 
6.0 x 10~ 4 
5.0 x 10~ 4 



BTCS Scheme 
Error 
8.0 x 10~ 3 
7.0 x 10~ 3 
7.0 x 10~ 3 
6.0 x 10~ 3 
5.0 x 10~ 3 
6.0 x 10~ 3 
6.0 x 10~ 3 
5.0 x 10~ 3 
4.0 x 10~ 3 
5.0 x 10~ 3 



Table 4: Results for // with h = 0.02, I = 0.0001 



Note that the results showed that the new technique is about thirty times faster than 
the (5,1) fully implicit finite difference BTCS scheme. 



7 Summary 



In this paper a parallel algorithm was developed and applied to the two-dimensional 



PARRALEL SCHEMES FOR TWO-DIMENSIONAL PARABOLIC... 467 



parabolic equation with a boundary integral condition replacing one boundary condition. 
The algorithm, which may be implemented on a parallel architecture using two processors, 
requires only the applications of tridiagonal solvers at every time step. This technique 
worked very well for two dimensional diffusion with an integral condition. For the model 
problems considered, the parallel algorithm ( 34 ) was found to be about thirty times 
faster than the fully implicit finite difference BTCS scheme of [14]. A comparison with 
the standard explicit finite difference scheme for the model problem clearly demonstrates 
that the new technique is computationally superior. The numerical results obtained by the 
new scheme discussed in this article on a test problem from the literature, give acceptable 
results and suggest convergence to the exact solution when h goes to zero. The new 
method discussed in this article is readily adaptable to similar three dimensional problems. 
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1. INTRODUCTION 

In rapidly developing area of nonlinear theory of differential equations, 
many important results have been obtained by the use of nonlinear functional 
analysis based on fixed point theory. Over the last thirty years, random op- 
erator theory has grown into a full fledged research area. The various ideas 
associated with random fixed point theory can be used to form a particularly 
elegant approach for the solution of nonlinear random systems (see [4]). Ran- 
dom fixed point theorems for random contraction mappings on Polish spaces 
were first proved by Spacek [21] and Hans [9,10]. Subsequently Bharucha-Reid 
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[5] (see also Mukherjea [16]) have given sufficient conditions for a stochas- 
tic analogue of Schauder's fixed point theorem for a random operator. Itoh 
[12,13] introduced random condensing operators and considerably improved 
their results. Sehgal and Water [19,20] considered Browder-Fan type random 
operators and as a consequence obtained a stochastic generalization of the well 
known Rothe fixed point theorem. Recently Papageorgiou [17], Lin [14], Xu 
[23], Beg [1], Tan and Yuan [22], Beg and Shahzad [2,3] and many other au- 
thors have studied the fixed points of random maps. The aim of this note is to 
study in continuation the fixed points of asymptotically nonexpansive random 
maps ( i.e. ill conditioned random problems ) and to construct a Mann type 
iteration scheme [15] which converges to a fixed point of the aysmptotically 
nonexpansive random map. 

2. PRELIMINARIES 

A Banach space A is called uniformly convex if for each e > there is a 
function S : (0,2] — ► (0, 1] such that if ||.x|| < d, \\y\\ < d , ||.x — y\\ > e then 
II 2^ || < (1 — 3(i))d- Moreover 8 is increasing. In the sequel all normed spaces 
are assumed to be taken over the real. 

Let (fi, E) be a measurable space ( E = sigma algebra ) and E a nonempty 
subset of a Banach space A. A mapping £ : Q — ► X is measurable if and only 
if £ _1 (£/) G E for each open subset U of X. The mapping T : Q x E ^ X is & 
random map if and only if for each fixed x G E, the mapping T(.,x) : Q — ► X 
is measurable, and it is completely continuous if for each uj G Q, the mapping 
T(u, .) : E —*■ X is completely continuous. A measurable mapping £ : Q — ► X 
is a random fixed point of the random map T : Q x E — ► A if and only 
if T(a;, £(u;)) = £(w) for each w G fi. For a subset i? of A, let diam(E) 
denote the diameter, cl(E) the closure, co(E) the convex hull, clco(E) the 
closed convex hull of the set E, S(xo,r) the spherical ball centred at Xq with 
radius r i.e. 5'(a;o,r) = {x G A : ||.Xo — x\\ < r} and T n (uj,x) the nth iterate 
T(w,T(«;,T(«;,...T(a;,a;)...))). 
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Definition 2.1. Let E be a subset of a Banach space X and T :VtxE —> E 
be a random map. 

(i). The random map T is said to be nonexpansive if for arbitrary x,y G E, 

||T(w,a:) -T(u,y)\\ < \\x - y\\, 
for each u £ Q. 

(ii).The random map T is called asymptotically nonexpansive if for each 
x,y G E, 

\\T n (tu,x)-T n (tu,y)\\ <k n \\x-y\\, 
for each u G Q and where {k n } is a decreasing sequence of real numbers such 
that lim k n = 1. 

(iii). The random map T is said to be uniformly L-Lipschitzian (L > 0) if 
and only if for all x, y G E, 

\\T n (cu,x)-T n (uj,y)\\ <L\\x-y\\ 
for each u G Q. 

Theorem 2.2. [24, Theorem 2]. Let p > 1, r > be two fixed real 
numbers. Then the Banach space X is uniformly convex if and only if there 
exists a continuous, strictly increasing and convex function g : R + — ► R + , 
g(0) = 0, such that, 

||ckf + (1 - a)y\\ p < a\\x\\ p + (1 - a)\\y\\ p - W^,(a)^(||a; - y\\), (1) 

for all x, y in S(0, r), a G [0, 1] where W p (a) = a(l - a) p + a p (l - a) . 
For more details and other related results we refer to [2,5,7,24]. 

3. THE RESULTS 

Our principal result in this section is the following random version of a 
generalization due to Goebel-Kirk [8] of Browder [6] fixed point theorem. 

Theorem 3.1. Let E be a nonempty closed bounded and convex subset 
of a separable uniformly convex Banach space X. Let T : Q x E — ► E be an 
asymptotically nonexpansive random map. Then T has a random fixed point. 

Proof. Let £ : Q — ► .E 1 be a measurable map. Let iy G fi be fixed, and let 
the set -R^iu) consists of those real numbers r for which there exists an integer 
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k such that 

£n(n^5(T>,^)),r)^. 

Since diam (E) G R^i w ), therefore R^( w ) 7^ 4>. Let r$ = inf R^ w ). For each 
e> 0, 

(u- =1 (n£*S (T>, £(«;)), r + 5))) n £, 

are nonempty and convex. The reflexivity of X further implies that 

n £>0 (el (ur =1 (n£*S (r>, eH), r + e)))) ni?^. 

Let 7/ : fi — > -B be a measurable map so that for each wefi, 

v (w) g n £>0 (el (ur =1 (n£*S (T>, £(«;)), r + 5)))) n £, 

(for existence of rj see [11]). 

Let a > 0, then there exists an integer n such that if n > n , then 

\\r](w) — T n (w, £,(w))\\ < r + a. 

Suppose that the sequence {T n (w,r](w))} does not converge to r)(w) (i.e. rj is 
not a random fixed point of T) . 

Then there exist e > and a subsequence {T ni (w, r](w))} of {T n (w, r](w))} 
such that 

\\T n '(w,rj(w))-rj(w)\\ >£, i = 1,2,3, -•- . 

For m > n, 

\\T n (w,r)(w))-T m (w,r)(w))\\ < k n \\rj{w) -T m - n (w,r/(w))\\, 

where A; n is the Lipschitz constant for T n obtained from the definition of as- 
ymptotic nonexpansiveness. Suppose r$ > and choose p > so that 

(l-<5(— ^— ))(r + p)<r . 

Select p so that ||t7(w) — T p (w,r](w))\\ > e, and also so that 

. p. 
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If no > p is sufficiently large, then m > no implies that 

\\n{w)-T m -v{ W) i{w))\\<ro+ P -. 
It further implies that 

\\T*(w,ri(w)) -T m (w,t(w))\\ < k p \\n{w) -7™->,£H)|| < r + p, 

also 

\\n(w)-T m (w,aw))\\<ro + p. 

Thus by uniform convexity of X, if m > n , then 

|| "H + 7"."H> - T">, ««,)) || < (l - * (^) ) (,„ + P) < r„, 

and this contradicts the definition of r . Hence we conclude r = or 
T(w,n(w)) = n(w) for each w G H But r = implies {T ra (w,^(w))} is 
a Cauchy sequence yielding lim^oo T n (w, £(u>)) = n(w) = T(w, n(w), for each 
w G fl. Therefore for each w G Q, 

n £>0 (ci (ur =1 (n™ =k s (T n (w, eH), r + £ )))) flE = {77H}. 

Hence n is a random fixed point of T. This completes the proof. 

Let X be a separable normed space, i? be a nonempty convex subset of X 
and L > 0. Let T : Q x E ^ E be a, uniformly L-Lipschitzian random map. 
We define a Mann [15] type random iteration process as follows. 

Let £o : ^ —* E be any measurable map. Then the following measurable 
maps are iteratively defined as follows: 

Vn ( W ) = P n T n ( W ,Uw)) + (1 " /3n)£nH, (2) 

and 

£n+i(w) = a n T n (w, ri n (w)) + (1 - a n )£ n (w) (3) 

for each w G fi and n = 0, 1, 2, 3, • • • where {a n } and {/?„} satisfy e < 1 — 
a n , 1 — f3 n < 1 — e for all n and some e > 0. Since £" is nonempty convex subset 
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of X therefore the sequences of measurable functions {£„} and {f] n } are well 
defined. 

Lemma 3.2. Let T : Q x E ^ E be a, uniformly L-Lipschitzian random 
map. Then for all n = 0, 1, 2, • • • . 

\\Uw)-T{w,Uw))\\ < \\T n (w,CnH) -Uw)\\ 



L + 3L 2 + 2L 6 ) HT^Kfn-iH) -Cn-i(w) 



(4) 



Proof. For all n = 0, 1, 2, • • • , we have 



\\Uw)-T{w,Uw))\\ < WUw) ~T n (w,U™))\ 



+ \\T n (w,U™))-T(w,U™) 



(5) 



Now, 



\T n (w,Uw)) T{wMw))\\ = \\T{w,T^\w,Uw))-T{w,Uw))\ 

< L\\T n -\w,Uw)) -CnMW 

< L\\T n -\w,Uw)) ~ {otn-iT^^w^-iiw)) + (1 - a n _i)Cn-i(^)} || 



<L 



a n - 1 \\T n - 1 (w,Uw))-T n - 1 (w, Vn ^ 1 (w 



+ (l-a n _ 1 )\\T n - 1 (w } U™))-Zn-i(w)\ 



<L 



L\\Uw)-Vn-l(w)\\ + \\T n -'(w,U™))-tn-l(w)\\ . (6) 



Also, 



\€n(w) ~ 77„_i(iy)|| < Un{w) ~ £ n _i(ttf)|| + ||£ n _i(w) - 7/ n _i(w)|| (7) 



More over, 



\£n-l{w) - r) n -i(w)\\ = \\£ n -l(w) - {!3 n -lT n (w,£ n -i(w)) 

+(i-/5 n _ 1 )e„-iH}n 
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n— 1 / 



< iit— i (w,e«-i(^)) - e»-i(w)ii- (s) 

And, 

Un(w) -f n _i(iu)|| = ||a n _iT n_1 (w,r7 n _i(u7)) + (1 - a„_i)f n _i(iu) -£ n _i(io)|| 

= a n _i||T n_1 (u7,T7 n _i(u7)) -^ n _i(iy)|| 

+ ||T n - 1 (^,en-i(^))-T"- 1 ( U ;,r /n _ 1 H)|| 

< l^n-lN -T^^^n-lH)!! +L\\U-l(w) -Vn-l{w)\\ 

<Un-l(w)-T n -\w,Cn-l(w))\\ 

+L||e„_iM - {ft-ir- 1 ^,^^)) + (i - /3„_i)en-iM} II 

< (i + L)||£„-iM -t^k^-iH)!!- (9) 

In equalities (7), (8) and (9) imply, 

UnH-Vn-l(w)\\ < (2 + L)U n - 1 (w)-T n -\w,Cn-lH)l (10) 

Also note that, 

||T"->,£nW)-£n-lW|| 

+ilT«- i (^,e„-iH)-en-iHii 

(by (9)) 

< {1 + L{1 + L)} \\T n -\w, Cn-lH) ~ Cn-lHW (11) 

Inequalities (6), (10) and (11) imply, 

\\T n (w,Uw))-T(w,Uw))\\ 
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<L 



L(2 + L)\\T n - 1 (w,^. 1 (w))-^. 1 (w) 



+ {1 + L(l + L)} IIT"" 1 ^,^-!^)) " £n-lH| 

= {L + 3L 2 + 2L 3 )||T™- 1 («;,£n-iM) - Cn-iHIl (12) 

Inequalities (5) and (12) yield the required inequality (2). This completes 
the proof. 

Theorem 3.3. Let X be a separable uniformly convex Banach space; 
(f) 7^ E C X, and £" closed bounded and convex. Let T : Q x E 1 — > E be 
a asymptotically nonexpansive random map with {/c n } satisfying 

oo 
n=l 

for all n and some e > 0. 

Let £o : ^ — > -E 1 be a measurable map and define 

t,n+i(w) = a n T n (w,i n (w)) + (1 - a n )i n (w) 

for n > 0. Then 

lim \\i n {w) -T(w,£ n (w)\\ = 0. 

Proof. Set M = [diam(E)] . From Theorem 3.1, T has a random fixed 
point £ (say) (i.e. £ : fi — ► i? a measurable map and T(w, £(w)) = £(w) ). 
Using inequality (1), with r > 1 such that E 1 C 5(0, r), we have, 

Un+i(w) -aw)\\ p = \\a n T n (w,Uw)) + (1 " a n )Uw)-Z(w)\\ p 

< a n \\T n ( W ,uw)) -ewr + (i -ardwuw) -enir 

-w P KMIIU™)-^KSnH)ll) 

< a n ^ne„H -eHir + (i -«n)iienH -enir 
-^ P K)^(iienH-T n («;,^H)ii) 
<nenH-eHii p + K^-« n )M 
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-W p (a n )g(\\Uw) -T n (w,U™))\\) (13) 

Since W p (a n ) > 2e p+1 ( See Xu [24]), thus from (13), we obtain 

0<2e p+1 g(UnM-T n (w,U^))\\) 

< Unfa) ~ £H \\ P - Un+1 (W) ~ £(w) \\ P ~ <*nM (A* - 1) . 

It implies that 

m 

2e^ 1 Y / 9(UnM-T n (w,U^))\\) 

n=0 

m 

< ll&M ~ £H \\ P + (1 " e)Mj2(K ~ 1) (14) 

n=0 

iFrom hypothesis on {£„(u>)}, inequality (14) implies that the series on the 
left converges. Therefore 

\\mg(\\Uw) -T n (w,Uw))\\) = 0. (15) 

Since g is continuous at and strictly increasing therefore (15) implies that 

hm UnH-T n ( W ,CnH) 11=0. 

Lemma 3.2. now further implies 

lim \\£ n (w)-T(w,£ n (w))\\ =0. 

n^oo 

This completes the proof. 

Now we are able to show that {£ n } converges to the random fixed point of 
T:ttxE^ E. 

Theorem 3.4. Let E be a nonempty closed bounded and convex subset 
of a separable uniformly convex Banach space X. Let T : Q x E — ► E be 
a completely continuous asymptotically nonexpansive random map with {k n } 
satisfying k n > 1 and Y^=\ (&£ — 1) < oo; r = max{2,p}; £ < a n < 1 — £ 
for all n and some e > 0. Choose £o : ^ — > -E a measurable map, and define 
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€n+i{w) = a n T n (w,£ n (w)) +(1 - a n )£ n (w), n > 0. Then {£„} converges to 
some random fixed point £ of T. 

Proof. Clearly, cIco({^q(w)} UT(fl x E)) is a compact subset of E contain- 
ing {in{w)}. Therefore there exists a measurable map £ : Q —> E and some 
subsequence {£n;(w)} of {£n(w)} such that lim^oo £ ni (w) = £(w). But T is 
continuous and lhm^oo ||£ n (w) ~~ T(w,£ n (w)) = by Theorem 3.3. Therefore 
T(w,£(w)) = £(iu). It implies that 

Un+i(w) -£(w)\\ = \\a n T n (w,Cn(w)) + (1 - a n )£ n (w) - f (to) || 

< a n ||T n («;,enH) - £{w)\\ + (1 - a n )||£„H - £{w)\\ 

= a n \\T n (w,Uw)) - T(w,£(w))\\ + (1 - a n )U n (w) - SHII 

< (a n k n + (1 - «„)) ||£ n (w) - £(io)|| 

for all n > 0. Since IT^ =1 A; r i converges and linij^ 00 £ ni (u') = £(iw). Therefore 
lim n ^ 00 £ n (u') = £(w). Hence {£„} converges to the random fixed point £ of 
T. 

Remark 3.5. Schu [18, Theorem 1.5] is a particular case of the determin- 
istic analogue of our Theorem 3.4. 
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1. Introduction 

The classical core-Walras equivalence result [13, Theorem II-2.1] is stated for 
a finite-dimensional commodity space, and a space of agents (fi, E, fi) rep- 
resented by a non-atomic, positive, countably additive measure space. This 
celebrated result of Equilibrium Theory has been extended in several direc- 
tions: in particular ([1, 3, 28]) discuss and face the problem of extending it 
to the case of strongly non- atomic finitely additive measures. The proof of 
the classical result is based upon an idea of Aumann, and makes use of the 
geometrical and topological properties of the multivalued integral of a suitable 
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multifunction ranging on the commodity space. 

The Aumann integral of a Banach-valued multifunction with respect to a fi- 
nitely additive measure has been considered in [21] and [22]: in [21] the in- 
vestigation concerned integrands with compact and convex values, while [22] 
examined the case of integrands with weakly compact and convex values; (see 
also [26] for a survey on this topic). 

Recently, in [23] we have examined the case of multifunctions of the type 

FM = [I»-eM]U{0}, (1) 

where T is a simple multifunction with values in the hyperspace of closed and 
convex values of a Banach space X, e £ L l (X) and fi is non atomic and 
countably additive. 

Integrands of this form are those that occur in the core-Walras equivalence: 
the assumption on T has a meaningful interpretation from the point of view 
of the economic model. 

The properties of the multivalued integral are usually achieved applying 
the classical Liapounov Theorem, together with its infinite dimensional ver- 
sions. It is well known that Liapounov Theorem does not extend to finitely 
additive measures: weakened forms of it in this more general setting have 
been obtained by several authors [18, 6, 7, 29, 8, 2, 3]. A survey of the most 
important results can be found in [19]. 

What we obtained in [23] was that, if e has Liapounov indefinite integral, then 
F has convex Aumann integral. The proof of the result was based upon the 
following result 

Theorem A ([23] Theorem 3.7) Let X and Y be two Banach spaces, with X 
satisfying the (RNP), \x : S — ► IRq a non-atomic countably additive measure, 

n 

f = / XilE t a Y -valued, simple function, and n^ = f ed\x an X -valued Lia- 

pounov measure. Then, setting m = J fd\x, the range of the pair {n\,n<i) is 
convex and compact inYx X w . 

This paper is concerned with the Aumann integral for integrands of the form 
(1), but when \x is simply finitely additive: in this setting Theorem A above 
is in general false; a counterexample can be obtained by means of the results 
in [8]. 
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However we shall show, through a completely different path, that the convex- 
ity of the Aumann integral of F can be reobtained also in the finitely additive 
setting, when the commodity space X is a Banach lattice and the indefinite 
integral of e satisfies suitable assumptions. 

2. Preliminaries and definitions 

Throughout this paper X will be a reflexive, separable Banach lattice, X + its 
positive cone. With X* we denote the topological dual and with X±, X* the 
unit balls of X and X* respectively. We denote by X w the space X equipped 
with its weak topology. 

Let O be a set, E a a-algebra of subsets of O and \x : E — ► [0, +oo[ a finitely 
additive bounded measure. In accordance to [11, Chapter III] we denote by 
L^(X) the space of A-valued, ^-integrable functions /. When l=lwe shall 
simply write L 1 . Throught the paper we will use the symbol fi to denote a 
scalar measure, while with the symbol m we denote a vector valued one. 

Definition 1 A finitely additive vector measure m : E — ► X is called 
Liapounov if, for every E £ E, m(T,E) '■= {m(A),A £ En E} is convex 
and weakly compact for every E £ E. Since we have assumed that A is a 
reflexive Banach space it is enough to assume that m(T,E) is bounded, closed 
and convex for every E £ T,. If, for every E £ T,, m(T,E) is only convex, we 
will say that misa convex measure. If, for every E gE, there exists B £ Eg 
such that m(B) = ^m(E), we will say that misa semiconvex measure. 

We remind that for a scalar finitely additive, bounded measure /U : E — > [0, +oo) 
the conceps of strong continuity, semiconvexity and Liapounov are equivalent 
[8, 18]; where [i is strongly continuous if for every e > there exists a finite 
decomposition of S7, A; L £ E, i = 1, . . . , n such that /J,(Ai) < e. 

For a finitely additive vector measure we will denote by \m\ the variation 
of m, defined for every E £ T,, by: 

\m\(E) = sup 2, ll m (^-i)ll 
n A t en 
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where the supremum is taken over all the finite decompositions II of the set 
E. Moreover we denote by ||m|| its semivariation, given by 

\\m\\(E) = sup{|x* m\(E), x* £ X* , \\x*\\ < 1}. 

It is known that: 

sup{||m(A)||,A€ Y.DE} < \\m\\(E) < 4sup{||m(A)||, A € Sn£}, 

see for example [9, Proposition 1.1]. 

In the framework of [27] we will make use of the Stone extension; more 
precisely Q will be the Stone algebra associated to £ and r : £ — ► Q the 
Stone isomorphism. 

With Ji : Q a — > [0, +oo[ we will denote the extended measure of //, where Q a 
is the ex-algebra generated by Q . Observe that if \x is strongly continuous then 
its Stone extension JL is non atomic and therefore Liapounov. Moreover, by 
[12], if / £ L l (X) then it is possible to define its Stone extension as a map 
/ G L^-(X) such that for every J?eE, 



fdfi = / fdjl (2) 

E Jt{E) 

where the left hand side is defined in accordance to [11, Chapter III]. 
As a consequence if / € L}(X) has Liapounov indefinite integral, its Stone 
extension / has the same property. It was also showed in [15] that 
p-almost everywhere. 



Let m : S — > X be a vector- valued finitely additive measure. We say that 
m is s-bounded if lirrin^oo m(A n ) = for every sequence {A n ) n of pairwise 
disjoint sets in S. 

Definition 2 A positive finitely additive measure a : S — ► [0, oof is a 
control for m if and only if ||ra|| ~ o~, in the sense that for every e > there 
exists 5 > such that the following implications hold: 

• if a(A) < 5 then ||m||(A) < e; 

• if ||m||(A) < 5 then a {A) < e. 
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A control a is said to be a Rybakov control if there exists a functional x* G X* 
such that a = \x*m\. 

Remark 1 In [10, 25] the following equivalences were proved: a finitely ad- 
ditive measure m is s-bounded if and only if there exists a control for m if and 
only if there exists a Rybakov control for m. If m is also of bounded variation 
then its variation is equivalent to a Rybakov control for m. The following proof 
of this equivalence was communicated to us by one of the referees. If a = |x*m| 
then obvioulsy a <C \m\. To prove the converse, for every F = {x*, . . . , x* } in 
X* let r)p := ViLi l x | m l be the lattice supremum of the finitely additive mea- 
sures |x*m|, . . . , |x*m|. Since all the r]p are dominated by \m\, then they are 
uniformly s-bounded and therefore they are uniformly <7-continuous. Hence the 
set-wise supremum |m|(i?) = sup{r]p(E),F C X*,F finite } is er-continuous. 
So, in this case, ||m|| ~ \m\. Moreover if m and a are countably additive then 
the e — 5 absolute continuity is equivalent to — absolute continuity. 

3. A Liapounov result 

We will now show that if m is X + -valued and s-bounded then it admits a 
Rybakov control of the form a = y*m for some y* G (X*) + . 

Lemma 1 If m : S — » X + is a s-bounded finitely additive measure then 
there exists y* £ (X*) + such that \\m\\ ~ y*m. 

Proof: Let a = |xo m | be a Rybakov control for m, and let y* = \xq\ in the 
Banach lattice X* . 

Then, easily, |xq(x)| < y*{x) for every x G X + . Let now e > be fixed and 
consider 5 according to the absolute continuity of ||ra|| with respect to a; let 
A G E be such that y*m{A) < 5; we want to show that ||m||(.A) < e. Indeed 
let LT be an arbitrary finite partition of A, since m is X + -valued we have 

Y^ \x* m(B)\ < ^2 y*m(B) = y*m(A) < 5. 
Ben Ben 

Taking the supremum with respect to LT we have o~(A) < 5, which in turn 

yields ||ra||(.A) < e. 

Conversely we prove now that y*m <C ||wi||. Let e > be fixed and consider 
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S = £||y*|| _1 . If \\m\\(A) < 5 then y*m{A) < \\y*\\ \\m\\{A) < \\y*\\ 5 < e. □ 
We will show now that, also in the finitely additive case, if m is of bounded 
variation it is possible to find a control which is equivalent to the variation of 
m. 

Proposition 1 Ifm:Ti—> X + is a finitely additive measure with bounded 
variation, then there exists y* £ (X*) + such that \m\ ~ y*m. 

Proof: By Lemma 1 there exists y* 6 (X*) + such that y*m is a Rybakov 
control, namely y*m ~ ||"i,||. Since m is of bounded variation then, by Remark 
1, ||m|| is equivalent to \m\. This concludes the proof. □ 

Proposition 2 Let m : £ — ► X + be an s-bounded finitely additive measure. 
The following are equivalent: 

2.1 m is semiconvex; 

2.2 m admits a filtering family, namely for every B € S there exists a filter- 
ing family {-Bt}ie[o,ii such that 

a) B = 0, Bi = B and, if t < t' , then B t C B t >; 

b) m{Bt) = tm{B), for every t G [0, 1]; 

2.3 m is a convex measure. 

Proof: 2.1) =^> 2.2). Let B e S be fixed. With a standard argument it 
is possible to construct a filtering sequence (Bt)t,t £ Q(2) which satisfies 
conditions a) and b), see for example [6, Lemma 2.1]. Let now t G]0, 1[ be 
fixed, with t ^ Q(2), and let (p n )n, (<ln)n be two sequences in Q(2) such that 
p n T t and q n { t. Put B' t = U n B Pn , B'{ = n n B qn and note that B[ C B' t '; 
hence 

m(B' t ) > sup m(B Pn ) = tm(B) = mtm(B qn ) > m(B'J) > m(B' t ). 

Therefore we can choose for instance Bt = f) n B qn . 

2.2) => 2.3). Let A.BeSbe fixed and let t G [0, 1]. As in [7, Theorem 2.4] 

let 

C t = (B\ A) t U(AHB)U(A\ B)i_ t , 
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where the families {(B \ A)t}t, {{A \ B)t}t are the filtering families for B \ A 
and A \ B respectively. We have that Co = A, C\ = B and 

m(C t ) = tm(B\A)+m(AnB) + (l-t)m(B\A) = 
= tm(B) + (1 - t)m(A) 

2.3) =>• 2.1). It is obvious. D 

The equivalence between 2.1) and 2.2) in the finite dimensional case had 
already been obtained in [7] . 

We shall suppose now that m is the indefinite integral of a function e & 
L l (X + ) which satisfies the following assumption: 

(h) jj, is a control for m. 

Remark 2 A sufficient condition for (h) is, for example, the following: 

cssinf llell > r > (3) 

The assumption (3) has a very meaningful interpretation in the application 
to the economic model namely, when e is the initial endowment. We could 
label it as minimal entrance feee (m.e.f.) because from the economic point of 
view this means that all the consumers, except a set of measure zero, have a 
minimal granted support r. 

Observe also that m.e.f. assumption is stronger than the condition (h). 
For instance, consider in the interval [0,1], \i : S — > [0, 1] defined as 

V( A ) = / ^r-^ dx > 
Ja^Vx 

where dx is the Lebesgue measure and A is a Lebesgue measurable set and 
take e(x) = 2\fx. 

The measure m = J ed/j, coincides with the Lebesgue measure and admits \jl 
as a control, but e does not satisfy m.e.f. assumption. 

We want now to prove the announced result: 
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Proposition 3 Let fj, be a s-bounded, non negative, finitely additive mea- 
sure, and let e £ L 1 {X + ) be such that m = ed\i is semiconvex and n is a 
control for m; then the pairs (m,/z) and (m,xfi) are convex for every x £ X. 

Proof: The idea of the proof is analogous to that given in [7, Propositions 2.4 
and 2.5]. First of all we prove that the pair (m, \m\) is semiconvex. 
Let E £ £ be fixed and consider the filtering family {-Et}tg[o,i] associated to m 
and E. If \m\(Ei/ 2 ) = 2~ 1 |to|(E') then we have semiconvexity. Otherwise let, 
for example, |m|(-Eq/ 2 ) < 2~ 1 |m|(.E) and therefore \m\(E\Ei/ 2 ) > 2~ 1 \m\(E). 
If we apply Proposition 2 to m we find the filtering families {At}t, {Bt}t as- 
sociated to E1/2 and E \ E 1 / 2 respectively. We set 

C t = A t UB 1 - t . 

By construction, for every t 

m(Ct) = tm(E 1/2 ) + (1 - t)m(E \ E 1/2 ) = l -m{E). 

We prove now that \m\ (Ct) is a continuous function in t. Let y*m be a Rybakov 
control for m as in Proposition 1. Therefore, for e > fixed, let 5 > be that 
of the absolute continuity of \m\ with respect to y*m. Let t,s £ [0, 1] be such 
that \t — s\ y*m{E) < 5; suppose for instance t < s. The set Ct A C s is given 
by (A s \ A t ) U (£i_i \ Bi_ s ). Since y*m{C t A C s ) = \t - s\y*m{E) < 5 we 
have that 

| \m\(C t )-\m\(C s ) | < |m|(QAC a )<e. 

Hence, by continuity, as |m|(Co) < 2~ 1 \m\(E), while |m|(Ci) > 2~ 1 |m|(-E), 
there exists t e]0, 1[ such that \m\(C t ) = 2~ 1 |m|(£'). 

Since jjl is a control for m this implies also the continuity of t h^ /i(Ci) and 
then the semiconvexity of (m,[x). Finally the convexity of the pair (m, /i) 
follows by Proposition 2 if we consider E = X x M, E + = X + x [0, oof, and 
the fact that the finitely additive measure (m, fx) is s-bounded. The convexity 
of (m, x/i) is an immediate consequence of the definition and of the convexity 
of (m,fj,). 
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4. Applications to multivalued finitely additive in- 
tegral of non convex integrands 

Throughout this section, and similarly as in [23], we will adopt the following 
notations: 

(i) (Q, E) is a measurable space and fi : £ — > [0,oo[ a s-bounded finitely 
additive measure which is also strongly continuous. 

(ii) {cf{X), h) and (cwk(X), h) are the families of non empty, convex, closed 
(non empty, convex and weakly compact respectively) subsets of X with 
the Hausdorff distance. 

(iii) r = ^2i=i Ci^-Ei is a simple multifunction with closed and convex values 
with Ei n Ej = for i / j; 

(iv) e G L 1 (X + ) is such that \{E) := / edfi is a convex finitely additive 

Je 
measure; 

(v) G=(T-e), F = Gu{0}. 

We examine now the problem studied in [23] when the measure with re- 
spect to which we integrate is only finitely additive. This case is not a mere 
extension of the countably additive one. It has applications for instance in fi- 
nitely additive economies which were introduced first in [1] by Armstrong and 
Richter: they explained why their model of a large economy is more realistic 
than the countably additive one. The same model was also extensively studied 
in [3], [4]. 

We begin with the bounded case, namely we assume that Cj G cwk(X) for 
every i = 1, . . . ,p. Thanks to the Radstrom Embedding Theorem ([24]) and 
to the Stone isomorphism we can consider the multifunctions T and G where 

r = i^Cii KEi) , (4) 

p 

G = T^~e = Y,Cd h{Er )-e. (5) 

i=l 
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Thanks to (v) above, F certainly admits totally measurable selections and 
so we can define the Aumann integral as usual, using the finitely additive 
/z-integrability, [11, Chapter 3]: namely 

- Sp is the set of all /i-integrable selections of a multifunction F, that is 

S F = {/ £ L^(X) : f(u>) G F(uj) /j, — almost everywhere}; 

- the Aumann integral of F is defined by 

(A)-J E Fd fi =Ufd fi , feS F \. 

The finitely additive case here considered is quite different from the case con- 
sidered, for instance, in [21] and [22], where the existence of suitable selections 
depends upon the topological properties of the values of F. 

Moreover, we shall denote by M-p, Mq the finitely additive multimeasures 
defined as the indefinite Aumann integrals of V and G respectively. 
By Rx(M/.\) we shall denote the range of Mr.\ that is 

Rx{M { . ) )= |J M ( .)(£). 

As r takes values in cwk(X), its Aumann integral is convex and weakly 
compact: in fact, for every Eg S, we have that 

V dii(E n Ei) C (A)- [ Tdn C (A) - I Tdjl = (6) 

~[ JE Jh(E) 

V 

= J2 C ^(h(EnEi)), 
i=i 

where the first inclusion and the last equality can be obtained by [9, Corollary 
8] since in the proof of this corollary the countable additivity is not required, 
while the middle inclusion can be obtained analogously to the proof of [21, 
Theorem 5.1] (note that, since T is simple, we do not need the compactness 
of the values of T) . 

Using these facts, statements analogous to those of [23, Propositions 3.1, 
3.2 and 3.4], with the strong continuity of \x replacing the non atomicity, hold 
for r also in the finitely additive case. Namely 



A NOTE ON A LIAPUNOV-LIKE THEOREM... 491 



(a) My{E) = < / sd/i, s is a simple selection of T >; 

(b) Rx(M r ) e cwk(X); 

(c) M G (E) = (A) - / Tdfi - edfie cwk(X) for every EeE. 

Je Je 

In [23], Theorem 3.7 allowed us to obtain that Rx(Mg) £ cwk(X) when jjl 
is non atomic and countably additive. In the finitely additive case, Theorem 
3.7 which is, in fact, a L yap ounov- type statement, does not hold in its com- 
plete extension (a counterexample is easily derived from [8, Theorem 4.4]). Its 
weakened finitely additive version, that is Theorem 1 below, will play a similar 
role in achieving the convexity of Rx(Mc) in our case, although weak com- 
pactness is not assured in general. These two results, despite their similarity, 
do not compare: here we have only finite additivity but we have to assume 
the equivalence between jjl and to; the proof of Theorem 1 completely differs 
from that of the quoted countably additive version, and is heavily based upon 
the results of Section 3. 

Theorem 1 Let X be a Banach lattice, Y a Banach space, /j, a strongly 
continuous finitely additive measure and e G L l (X + ) be such that the finitely 
additive measure A = j edfi is convex and admits fi as a control. If f is a Y- 
valued simple function then setting to = / /d/i, the range of the pair (to, A) 
is convex inYxX. 

Proof: We know that if / = Y^=i C i^-Ei for some finite decomposition of 0,, 
{Ei, . . . , E p } then 

(to, A)(E jB ) = (m, X)(T, EnEl ) + . . . + (to, A)(S jBnjBp ). 

So it is enough to note that, from Proposition 3, for each i = 1, . . . p, 

{m, X)(T,EnEi) is convex and to is a multiple of \x on T,EnEi- E 

As an immediate consequence we obtain that 

Corollary 1 Let fj = X^=i z )^Ei-,j = 1>2 be simple and measurable func- 
tions with values in X and let A as in Theorem 1. Then setting nij = j fjdfi—X 
we have that m\,mi, (mi, 777-2) are convex finitely additive measures. 



492 A.MARTELLOTIASAMBUCINI 



Proof: Applying Theorem 1, one easily deduces that mi and 777,2 are convex 
finitely additive measures. We shall prove that the finitely additive measure 
(7771,777,2) is convex. It is enough to prove that from Proposition 3, for each 
i = 1, . . .p, (777,1, 7772) (S^n^J is convex for every E eE. 

By Proposition 3 the pair (A,/i) is convex. So, for every D\,D 2 S ^EnEi and 
for every t G]0, 1[ there exists C £ T,EnEi such that 

/i(C) = i/i(£»i) + (l-t)/z(D 2 ) 
A(C) = iApi) + (l-i)A( J D 2 ). 

Since on S^n^ ttt-j = z*/j we have, for j = 1, 2, 

m »(C7) = 4/j(C)-A(C) = 

= te}/i(A) + (1 - *)*>(A2) - <A(£>i) - (1 - t)A(£> 2 ) = 
= tm j (D 1 ) + (l-t)m j (D 2 ); 

thus 

(mi,m 2 )(C) = t{m ll m 2 ){D 1 ) + (1 - t)(mi,m 2 )(I>2)- n 

Because of Theorem 1 the set Rx(Mg) is convex. In fact 
Theorem 2 Under the previous assumptions Rx(Mg) is convex. 

Proof: It is possible to prove it in an analogous way as in [16, Lemma 7]. 
We report the proof for completeness. We recall that G = V — e, with V = 
Yh=i CdEi and A = / ed\x. 

Let ip = 2^f=i Zjlfij be a fixed simple selection of T. Consider the finitely 
additive selection measure v of Mq, defined as 

u(-) = / ipdfi — A. 

Fix xi, x 2 in Rx{Mg) and t g]0, 1[. Then there exist two sets A±,A 2 £ £ and 
two simple selections of V, f\ and f 2 (fj = Y^=i ^Ae^J = lj 2), such that 



/ fjd/i-XiAj), 

J A; 



J = 1,2. 
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We put rrij = J fjdfj, — A, for j = 1,2. By Corollary 1 mi,m2, (mi, m 2 ) are 
convex. Then there exist B\^B 2 ,B 3 such that B\ C A\ \ A2, B2 C A2 \ A\ 

and 5 3 C ii fl A 2 and 

mi(Bi) = tej^! \ A 2 ); m 2 (£ 2 ) = (1 - ^2(^2 \ A x )\ 

(m 1 ,m 2 )(B 3 ) = t(m 1 ,m 2 )(A 1 C\A 2 ). 

Set B A = (ii fl i 2 ) \ B 3 and B = uf =1 Bi. We will show that for a suitable 
selection v* of Mq, v*{B) = tx\ + (1 — t)x 2 . 
Indeed, for every E £ Y,, define 

V *(E) = v(E\B) + m 1 (En(B 1 UB 3 )) + m 2 (En(B 2 UB 4 )) = 
= [ ydn- \{E \B)+ I hdn + 

JE\B JEn(B 1 UB 3 ) 

+ f hdn - H E n B) = 

JEn(B 2 UB 4 ) 

= / {f ■ 1e\b + /i ■ l£n(BiUB 3 ) + h ■ lEn(B 2 uB 4 )) dfi- X(E). 
Je 



Then the finitely additive measure v* is a selection of Mq and if we evaluate 
v* on the set B we obtain 

v*(B) = m 1 (B 1 UB 3 ) + m 2 (B 2 UB 4 ) = tm 1 (A 1 \A 2 ) + tm 1 (A 1 nA 2 ) + 
+ (1 - t)m 2 (A 2 \ Ax) + m 2 (A 1 n A 2 \ B 3 ) = 
= tx 1 + (l-t)x 2 . □ 

Moreover we have that 

Theorem 3 If G is a multifunction as before then cl{Rx(Mc)} = 
Rx{Mq). 

Proof: From (c), (5), (6) above and [22, Theorem 5.1], (which holds in our case 

without compactness of values of G), for every E £ T,,Mq(E) C Mq{t{E)). 

For the converse inclusion, if x € Rx(Mq) then there exists a set H £ Q a 

such that 

x £ Mq{H) = {A)- [ Tdji - A (H) 
Jh 
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and then x = y^Xi/j,(H n r{Ei)) — A (H) for some Xi G Q, i = 1, . . . ,p. 

i=l 

Since ^ is Frechet-Nikodym dense in Q a , for every e > there exists a Q - 
measurable set B such that 

HHAB) < JL, 

for some k > max{||e||i, h(C±, {0}), . . . , h(C p , {0})}. Then 

v v 

^xip(B n T (Ei)) -X(B)\\<J2 \\ Xi \\jl(HAB) + |A \(HAB) < e. 



p 
\x 

i=i i=i 



Since Z? is a C? -measurable set, there exists E £ Y, such that t{E) = B; let us 

put 

p p 

x £ := Y^ x iUB n r(^)) -\(B) = J2 X M E n ^) " A (^)- 
i=i i=i 

We have that x e £ Rx(Mg) and ||x — x e || < e. 

Hence Rx(Mq) C cZ{.Rx(-^g)} an d again applying (c), the equality follows. □ 

Now, as in [23, Theorem 3.15], one can show that, if G(u>) for all 
uell, then, for every E £ T,, 

(A)- [ Fdfi = R x (M G \xnE) (7) 

Je 

and therefore it is convex. 

Let us now consider the unbounded case, namely assume that Cj £ cf(X), i 

1, . . .p. For each integer n, consider 

r„H = r( w ) n nx l5 F n ( w ) = (r n (w) - e(w)) u {o}. 

As in [23, Proposition 3.16] one shows that, for every £6S 

(A) - f Fdn = (J (A) - / F n d/x. (8) 

The sequence on the right hand side of (8) is increasing. Hence, if £" G, 

(A) — / Fdfj, is the union of an increasing sequence of convex sets, and therefore 

Je 
is convex. 
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For the general case, that is when some values of G contain 0, similarly to 
[23, Theorem 3.18] denote by f^o the set {w:0£ G(lo)}; the Aumann integral 
splits into two parts: 

(A) - [ Fdfi = {A)- I Fdfi + {A)- I Fdfi 
Je JEnn JE\n 

It only remains to derive the convexity of the first summand by noting that 
F has convex values in 0,q. 

In conclusion we have obtained the following: 

Theorem 4 Let (fi,E,//) be a non negative finitely additive measure space 

with fj, strongly continuous, X a Banach lattice, T : 0, — > cf(X) a simple 

multifunction and e G L l (X + ) generate a semiconv ex finitely additive measure 

which admits [i as a control. Consider F = (T — e) U {0}. Then, for every 

E € X, (^4) — / Fdfi is convex. 
J E 

We shall now derive a result similar to [23, Theorem 3.18]. Indeed, by 
means of Theorem 3, and the results in the countably additive case, [23, The- 
orem 3.18], we obtain the following 

Theorem 5 Let (fi,E,//) be a non negative finitely additive measure space 

with \x strongly continuous, X a Banach lattice, T : 0, — ► cf(X) a simple 

multifunction and let e £ L 1 (X + ) generate a Liapounov indefinite integral 

which admits [i as a control. Consider F = (T — e) U {0}. Then, for every 

E G S, (^4) — / Fdfi is a convex set which is the union of an increasing 

J E 
sequence of convex and relatively weakly compact sets. 

Proof: The line of the proof is somewhat analogous to that of [23, Theorem 
3.18]. As before consider 

r n (w) = r(w)nnXi, G n (w) = r n (w)-e(w), F n = G n (u) U {0}. 

and the Stone transforms of T n , G n , denoted by T n and G n respectively. From 
[23, Theorem 3.14] Rx{Mq ) is weakly compact and, since from (7), 



(A)- [ F n dfi = Rx(M GnU 
Je 
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by Theorem 3 above, we have that cl < / F n dn\ is convex and weakly compact 
for each n. Hence the already proved equality 



(A) - f Fdti = \J (A)- f F n dii 

JE „ JE 



shows the assertion. □ 

Remark 3 Note that Theorem 5 above is the only result that can be par- 
tially derived from the countably additive case [23] by means of the Stone 
extension. The other results in this section, despite the similarity of the state- 
ments, cannot be obtained in this way since the assumptions here and in [23] 
do not compare. For example if we take \i the Lebesgue measure on [0, 1], a 
non negative integrable function e such that /x(supp e) G]0, 1[ then A = J edfi 
is automatically Liapounov but \x ft; A. Viceversa the example which can be 
derived from [8] verifies the hypothesis on \i and A given here but A is not 
Liapounov. 
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